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Atty. Docket No. 25436/1560 

COMPOSITIONS AND METHODS UTILIZING DNA POLYMERASES 
ABSTRACT OF THE DISCLOSURE 

The invention features a novel isolated Family B DNA polymerase, a Thermococcus 
polymerase JDF-3, and mutant recombinant forms thereof. Mutant polymerases of the invention 
are deficient in 3' to 5' exonuclease activity and/or exhibit reduced discrimination against non- 
conventional nucleotides relative to the wild-type form of the polymerase. 

FIELD OF THE INVENTION 

The present invention relates to compositions and methods utilizing DNA polymerase 
enzymes with reduced discrimination for non-conventional nucleotides. The enzymes of the 
invention are useful in many applications calling for the detectable labeling of nucleic acids and 
are particularly useful in DNA sequencing applications. 

BACKGROUND OF THE INVENTION 

Detectable labeling of nucleic acids is required for many applications in molecular 
biology, including applications for research as well as clinical diagnostic techniques. A 
commonly used method of labeling nucleic acids uses one or more unconventional nucleotides 
and a polymerase enzyme that catalyzes the template-dependent incorporation of the 
unconventional nucleotide(s) into the newly synthesized complementary strand. 

The ability of a DNA polymerase to incorporate the correct deoxynucleotide is the basis 
for high fidelity DNA replication in vivo. Amino acids within the active site of polymerases 
form a specific binding pocket that favors the placement of the correct complementary 
nucleotide opposite the template nucleotide. If a mismatched nucleotide, ribonucleotide, or 
nucleotide analog fills that position, the precise alignment of the amino acids contacting the 



incoming nucleotide may be distorted into a position unfavorable for DNA polymerization. 
Because of this, the unconventional nucleotides or nucleotide analogs used to label DNA tend to 
be incorporated into the elongated strand less efficiently than do the standard deoxynucleotide 
triphosphates (dNTPs; the so-called "standard" dNTPs include deoxyadenosine triphosphate 
(dATP), deoxycytosine triphosphate (dCTP), deoxyguanosine triphosphate (dGTP), and 
thymidine triphosphate (TTP)). 

The reduced efficiency with which unconventional nucleotides are incorporated by the 
polymerase increases the amount of the unconventional nucleotide necessary for DNA labeling. 
The reduced efficiency of incorporation of a particular nucleotide can also adversely affect the 
performance of techniques or assays, such as DNA sequencing, that depend upon unbiased 
incorporation of unconventional nucleotides for homogeneous signal strength. 

The identity and exact arrangement of the amino acids of a DNA polymerase that contact 
an incoming nucleotide triphosphate determine the nature of the nucleotides, both conventional 
and unconventional, that may be incorporated by that polymerase enzyme. Changes in the exact 
placement of the amino acids that contact the incoming nucleotide triphosphate at any stage of 
binding or chain elongation can dramatically alter the polymerase's capacity for utilization of 
unusual or unconventional nucleotides. Sometimes changes in distant amino acids can influence 
the incorporation of nucleotide analogs due to indirect global or structural effects. Polymerases 
with increased capacity to incorporate nucleotide analogs are useful for labeling DNA or RNA 
strands with nucleotides modified with signal moieties such as dyes, reactive groups or unstable 
isotopes. 

In addition to labeled nucleotides, an extremely important class of modified nucleotides 
is the dideoxynucleotides. The so-called "Sanger" or "dideoxy" DNA sequencing method 
(Sanger et al, 1977, Proc. Natl. Acad. Sci. USA 74: 5463, which is incorporated herein by 
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reference) relies upon the template-directed incorporation of nucleotides onto an annealed primer 
by a DNA polymerase from a mixture containing deoxy- and dideoxynucleotides. The 
incorporation of a dideoxynucleotide results in chain termination, the inability of the enzyme to 
catalyze further extension of that strand. Electrophoretic separation of reaction products results 
in a "ladder" of extension products wherein each extension product ends in a particular 
dideoxynucleotide complementary to the nucleotide opposite it in the template. The distance of 
the dideoxynucleotide analog from the primer is indicated by the length of the extension product. 
When four reactions, each containing one of the four dideoxynucleotide analogs ddA, ddC, ddG, 
or ddT (ddNTPs) are separated on the same gel, the sequence of the template may be read 
directly from the ladder patterns. Extension products may be detected in several ways, including 
for example, the inclusion of isotopically- or fluorescently-labeled primers, deoxynucleotide 
triphosphates or dideoxynucleotide triphosphates in the reaction. 

Fluorescent labeling has the advantages of faster data collection, since detection may be 
performed while the gel is running, and longer reads of sequence data from a single reaction and 
gel. Further, fluorescent sequence detection has allowed sequencing to be performed in a single 
reaction tube containing four differentially-labeled fluorescent dye terminators (the so-called 
dye-terminator method, Lee et al, 1992, Nucleic Acids Res. 20: 2471, incorporated herein by 
reference). 

A desirable quality of a polymerase useful for DNA sequencing is improved 
incorporation of dideoxynucleotides. Improved incorporation of dideoxynucleotides can make 
processes such as DNA sequencing more cost effective by reducing the requirement for 
expensive radioactive or fluorescent dye-labeled dideoxynucleotides. Moreover, unbiased 
dideoxynucleotide incorporation provides improved signal uniformity, leading to increased 
accuracy of base determination. The even signal output further allows subtle sequence 



differences caused by factors like allelic variation to be detected. Allelic variation, which 
produces two different half strength signals at the position of relevance, can easily be concealed 
by the varied signal strengths caused by polymerases with non-uniform ddNTP utilization. 

Incorporation of ribonucleotides by the native form of DNA polymerase is a rare event. 
Mutants that incorporate higher levels of ribonucleotides can be used for application s such as 
sequencing by partial ribosubstitution. In this system, a mixture of ribonucleotides and 
deoxynucleotides corresponding to the same base are incorporated by the mutant polymerase 
(Barnes, 1978 J. Mol. BioL 1 19:83-99). When the ribosequencing reactions are exposed to 
alkaline conditions and heat, fragmentation of the extended strand occurs. If the reactions for all 
four bases are separated on a denaturing acrylamide gel, they produce a sequencing ladder, there 
is a need in the art for polymerase mutants with higher utilization of ribonucleotides for this 
alternative method of sequencing. 

Alternatively, the incorporation of ribonucleotides followed by alkaline hydrolysis could 
be utilized in a system that requires random cleavage of DNA molecules such as DNA shuffling 
((Stemmer, 1994, Nature, 370: 389-391) which has also been called molecular breeding, sexual 
PCR and directed evolution). 

Another desirable quality in a DNA labeling enzyme is thermal stability. DNA 
polymerases exhibiting thermal stability have revolutionized many aspects of molecular biology 
and clinical diagnostics since the development of the polymerase chain reaction (PCR), which 
uses cycles of thermal denaturation, primer annealing, and enzymatic primer extension to 
amplify DNA templates. The prototype thermostable DNA polymerase is Taq polymerase, 
originally isolated from the thermophilic eubacterium Thermus aquaticus. So-called "cycle 
sequencing" reactions using thermostable DNA polymerases have the advantage of requiring 



smaller amounts of starting template relative to conventional (i.e., non-cycle) sequencing 
reactions. 

There are three major families of DNA polymerases, termed families A, B and C. The 
classification of a polymerase into one of these three families is based on structural similarity of 
a given polymerase to E. coli DNA polymerase I (Family A), II (Family B) or III (family C). As 
examples, Family A DNA polymerases include, but are not limited to Klenow DNA polymerase, 
Thermus aquaticus DNA polymerase I (Taq polymerase) and bacteriophage T7 DNA 
polymerase; Family B DNA polymerases, formerly known as a-family polymerases (Braithwaite 
and Ito, 1991, Nuc. Acids Res. 19:4045), include, but are not limited to human a,5 and e DNA 
polymerases, T4, RB69 and cp29 bacteriophage DNA polymerases, and Pyrococcus furiosus 
DNA polymerase (Pfu polymerase); and family C DNA polymerases include, but are not limited 
to Bacillus subtilis DNA polymerase III, and E. coli DNA polymerase III a and e subunits 
(listed as products of the dnaE and dnaQ genes, respectively, by Brathwaite and Ito, 1993, 
Nucleic Acids Res. 21: 787). An alignment of DNA polymerase protein sequences of each 
family across a broad spectrum of archaeal, bacterial, viral and eukaryotic organisms is presented 
in Braithwaite and Ito (1993, supra), which is incorporated herein by reference. 

The term used to describe the tendency of DNA polymerases to not to carry the 
incorporation of unnatural nucleotides into the nascent DNA polymer is "discrimination". In 
Family A DNA polymerases, the effective discrimination against incorporation of 
dideoxynucleotide analogs is largely associated with a single amino acid residue. The majority 
of enzymes from the Family A DNA polymerases have a phenylalanine (phe or F) residue at the 
position equivalent to F762 in E. coli Klenow fragment of DNA polymerase and demonstrate a 
strong discrimination against dideoxynucleotides. A few polymerases (e.g. T7 DNA 



polymerase) have a tyrosine (tyr or Y) residue at the corresponding position and exhibit 
relatively weak discrimination against dideoxynucleotides. Family A polymerases with tyrosine 
at this position readily incorporate dideoxynucleotides at levels equal to or only slightly different 
from the levels at which they incorporate deoxynucleotides. Conversion of the tyrosine or 
phenylalanine residues in the site responsible for discrimination reverses the dideoxynucleotide 
discrimination profile of the Family A enzymes (Tabor and Richardson, 1995, Proc. Natl. Acad. 
Sci. USA 92:6449). 

Among the thermostable DNA polymerases, a mutant form of the Family A DNA 
polymerase from Thermus aqaticus, known as AmpliTaq FS® (Perkin Elmer), contains a F667Y 
mutation at the position equivalent to F762 of Klenow DNA polymerase and exhibits increased 
dideoxynucleotide uptake (i.e., reduced discrimination against ddNTPs) relative to the wild-type 
enzyme. The reduced discrimination for dideoxynucleotide uptake makes it more useful for 
fluorescent and labeled dideoxynucleotide sequencing than the wild-type enzyme. 

The F667Y mutant of Taq DNA polymerase is not suited, however, for use with 
fluorescein-labeled dideoxynucleotides, necessitating the use of rhodamine dye terminators. 
Rhodamine dye terminators that are currently utilized with Taq sequencing reactions, however, 
stabilize DNA secondary structure, causing compression of signal. Efforts to eliminate 
compression problems have resulted in systems that use high amounts of the nucleotide analog 
deoxyinosine triphosphate (dITP) in place of deoxyguanosine triphosphate. While incorporation 
of (dITP) reduces the compression of the signal, the presence of dITP in the reaction produces 
additional complications including lowered reaction temperatures and increased reaction times. 
Additionally, the use of rhodamine dyes in sequencing requires undesirable post-reaction 
purification (Brandis, 1999 Nuc. Acid Res. 27:1912). 



Family B DNA polymerases exhibit substantially different structure compared to Family 
A DNA polymerases, with the exception of the position of acidic residues involved in catalysis 
in the so-called palm domain (Wang et al., 1997, Cell 89:1087; Hopfner et al., 1999, Proc. Natl. 
Acad. Sci. USA 96:3600). The unique structure of Family B DNA polymerases may permit a 
completely different spectrum of interactions with nucleotide analogs, perhaps allowing 
utilization of analogs which are unsuitable for use with Family A DNA polymerases due to 
structural constraints. Thermostable Family B DNA polymerases have been identified in 
hyperthermophilic archaea. These organisms grow at temperatures higher than 90 1C and their 
enzymes demonstrate greater themostability (Mathur et al., 1992, Stratagies 5:11) than the 
thermophilic eubacterial Family A DNA polymerases. Family B polymerases from 
hyperthermophilic archaea may be well suited starting substrates for modification(s) to reduce 
discrimination against non-conventional nucleotides. 

Although the crystal structures of three Family B DNA polymerases have been solved 
(Wang et al., 1997, supra; Hopfher, K.-R et al, 1999, Proc. Natl. Acad. Sci. 96: 3600; Zhao, 
1999, Structure Fold Des., 7:1 189), the structures of DNA-polymerase or dNTP-polymerase co- 
complexes have not yet been reported. At present, identification of amino acid residues 
contributing to nucleotide analog discrimination can only be inferred from extrapolation to 
Family A-dNTP structures or from mutagenesis studies carried out with related Family B DNA 
polymerases (e.g., human pola, phage T4, phage <p29, T. litoralis DNA polymerase). 

Sequence comparison of the Family B DNA polymerases indicate six conserved regions 
numbered I- VI (Braithwaite and Ito, 1993, supra). The crystal structure of bacteriophage RB69 
DNA polymerase (Family B) proposed by Wang et al. (Wang et al, 1997, supra) shows that 
Y41 6 in region II ( which corresponds to Y409 in the Family B DNA polymerase of 
Thermococcus species JDF-3) has the same position as Yl 15 in HIV reverse transcriptase (RT) 



and E710 in the Klenow fragment (Family A polymerases). Modeling of the dNTP and primer 
template complex in RB69 was carried out using the atomic coordinates of the reverse 
transcriptase-DNA cocrystal. This model predicts the RB69 Y416 packs under the deoxyribose 
portion of the dNTP. Tyrosine at this position has been implicated in ribose selectivity, 
contributing to polymerase discrimination between ribonucleotides and deoxribonucleotides in 
mammalian reverse transcriptases (Yl 15) (Gao et al., 1997, Proc. Natl. Acad. Sci. USA 94:407; 
Joyce, 1994, Proc. Natl. Acad. Sci. USA 94:1619) and in Family A DNA polymerases where 
modification of the corresponding invariable glutamate residue (E710) reduces discrimination 
against ribonucleotides (Gelfand et al, 1998, Pat. No. EP0823479; Astatke et al, 1998, Proc. 
5 Natl. Acad. Sci. USA 96:3402). 

3 Mutagenesis studies done in Family B DNA polymerases also implicate the region 

J containing the analogous Y in region II in dNTP incorporation and ribose selectivity. Mutations 

at the corresponding Y865 in human DNA polymerase a affect polymerase fidelity and 
Z_ sensitivity to dNTP nucleotide inhibitors such as AZT-TP, which has a bulky 3'-azido group in 
] place of the 3 5 -OH group, BuPdGTP, which contains a butylphenyl group attached to the amino 
J group at the C-2 position in the guanine base of dGTP (resulting in a bulkier and more 
hydrophobic purine base nucleotide) and aphidicolin, a competitive inhibitor of pyrimidine 
deoxynucleotide triphosphate. Interestingly, the mutants showed no difference in their uptake of 
ddCTP (Dong et al., 1993, J. Biol. Chem. 268: 26143). Additionally, mutants of bacteriophage 
T4 DNA polymerase, which have converted L412 to methionine (M) or isoleucine (I) just one 
amino acid before the analogous Y (Y41 1), show extreme and mild sensitivity, respectively, to 
the inorganic pyrophosphate analog phosphonoacetic acid (PAA). Alterations in PAA sensitivity 
have been shown to predict polymerase interactions with nucleotide analogs. L412 in T4 DNA 
polymerase corresponds to L410 in Thermococcus species JDF-3 DNA polymerase. The L412M 

-8- 



T4 DNA polymerase mutant was inhibited with 50-fold less ddGTP than wild-type polymerase 
while the K m s for dGTP was similar. As stated by the authors in that study, "[d]espite the 
sensitivity of the L412M DNA polymerase to ddGTP, there was no difference found in the 
incorporation of ddNTPs by wild-type and L412M DNA polymerase." (Reha-Krantz et al., 1993, 
J. Virol. 67:60). In bacteriophage cp29, mutations in region II (LYP where Y is analogous to 
Thermococcus species JDF3 DNA polymerase Y409) produce mixed results when challenged 
with PAA; P255S was hypersensitive to PAA while L253 V was shown to be less sensitive than 
the wild-type enzyme (Blasco et al, 1993, J. Biol. Chem. 268: 24106). These data support the 
role of the LYP region (region II) in polymerase-nucleotide interactions, but improved 
incorporation of ddNTPs was not achieved in these references. 

In another study, extensive mutation of region II in the archaeal Family B DNA 
polymerase from Thermococcus litoralis DNA polymerase (VENT™ polymerase, New England 
Biolabs) was performed. In that study, 26 different site-directed mutants were made for the sole 
intent of examining nucleotide analog discrimination (Gardner and Jack, 1999, Nucleic Acids 
Res. 27: 2545). Site-directed mutagenesis of VENT™ DNA polymerase demonstrated that three 
mutations at Y412 (which corresponds to JDF-3 DNA polymerase Y409) could alter nucleotide 
binding (Gardner and Jack, 1999, supra). Y412V was most significant with a 2 fold increase in 
dideoxynucleotide incorporation and a 200 fold increase in the incorporation of ribonucleotide 
ATP. The mutation Y412F showed no change in analog incorporation. 

Region III of the Family B polymerases (also referred to as motif B) has also been 
demonstrated to play a role in nucleotide recognition. This region, which corresponds to AA 487 
to 495 of JDF-3 Family B DNA polymerase, has a consensus sequence KX 3 NSXYG (Jung et al, 
1990, supra; Blasco et al, 1992, supra; Dong et al., 1993, J. Biol. Chem. 268:21163; Zhu et al., 
1994, Biochem. Biophys. Acta 1219:260; Dong and Wang, 1995, J. Biol. Chem. 270:21563), and 
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is functionally, but not structurally (Wang et al, 1997, supra), analogous to KX 3 (F/Y)GX 2 YG in 
helix O of the Family A DNA polymerases. In Family A DNA polymerases, such as the Klenow 
fragment and Taq DNA polymerases, the O helix contains amino acids that play a major role in 
dNTP binding (Astatke et al., 1998, J. Mol. Biol. 278:147; Astatke et ah, 1995, J. Biol Chem. 
270:1945; Polesky et al, 1992, J. Biol. Chem 267:8417; Polesky et al., 1990, J. Biol Chem. 
265:14579; Pandey et al, 1994, J. Biol Chem. 269:13259; Kaushik et al, 1996, Biochem. 
35:7256). Specifically, helix O contains the F (F763 in the Klenow fragment; F667 in Taq) 
which confers ddNTP discrimination in Family A DNA polymerases (KX3(F/Y)GX 2 YG) (Tabor 
and Richardson, 1995, supra). 

Directed mutagenesis studies in region III of VENT™ DNA polymerase also targeted an 
alanine analogous to A485 of the Thermococcus species JDF-3 DNA polymerase (Jung et al., 
1990, supra). These mutants (A~>C, A^S, A-*L, A ->I, A~>F and A->V) exhibited a range of 
specific activities from 0.12 to 1.2 times the polymerase activity of the progenitor enzyme 
(Gardner and Jack, 1999, Nucl. Acids Res. 27:2545). The dideoxynucleotide incorporation 
ranged from 4 to 15 times the unmutated enzyme. Interestingly, the mutant with the highest 
dideoxynucleotide incorporation (15x) had a specific activity of only 0.12x of the original 
enzyme. 

Site-directed mutagenesis studies on the Family B DNA polymerase from Thermococcus 
barossii modified each residue independently in the sequence ILANSF, which corresponds to 
AA residues 488-493 of the JDF-3 DNA polymerase, to tyrosine (Reidl et al., U.S. Patent No. 
5,882,904). That study indicated that an L489Y mutant exhibits approximately 3 times greater 
incorporation of dideoxynucleotides relative to an enzyme bearing the wild-type leucine residue 
at this site. 
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One area of active research involves the use of nucleic acid arrays, often referred to as 
nucleic acid or DNA "chips", in the simultaneous analyses of multiple different nucleic acid 
sequences. Many of these applications, such as those described in U.S. Patent No. 5,882,904 
(Reidl et al., issued March 16, 1999) will benefit from DNA polymerases exhibiting reduced 
discrimination against non-conventional nucleotides, particularly fluorescently-labeled non- 
conventional nucleotides. Applications being addressed in the chip format include DNA 
sequencing and mutation detection, among others. For example, the "mini-sequencing" methods 
(e.g., Pastinen et al, 1997, Genome Res. 7: 606; Syvanen, 1999, Human Mutation 13: 1-10) and 
the arrayed primer extension (APEX) mutation detection method (Shumaker et al, 1996, Hum. 
Mutat. 7: 346) and methods like them can benefit from DNA polymerases with reduced 
3 discrimination against fluorescently-labeled or other non-conventional nucleotides. There is a 
|:j need in the art for a non-discriminating DNA polymerase for use in chip or gel based mini- 
sequencing systems. Such a system would advantageously permit detection of multiplexed 

2 single nucleotide polymorphisms (SNPs) and allow for quantitative genotyping. Identification of 
; j sequence variation permits the diagnosis and treatment of genetic disorders, predisposition to 

3 multifactorial diseases, and sensitivity to new or existing pharmaceutical products. 

There is a need in the art for DNA polymerases with reduced discrimination against 
unconventional nucleotides. There is particularly a need in the art for thermostable DNA 
polymerases exhibiting reduced discrimination against dideoxynucleotides, and further, for DNA 
polymerases exhibiting reduced discrimination against fluorescently labeled dideoxynucleotides. 

SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods utilizing DNA polymerase 
enzymes exhibiting reduced discrimination against non-conventional nucleotides. Enzymes with 



this quality are useful in many applications calling for the detectable labeling of nucleic acids 
and are particularly useful in DNA sequencing applications. 

The invention further relates to a Family B DNA polymerase having one or more 
mutations at a site or sites corresponding to L408, P410, S345, and/or A485 of SEQ ID NO: 2, or 
a fragment thereof which retains the ability to direct the template-dependent polymerization of 
nucleic acid. The invention also encompasses mutants and modified versions (e.g., reversibly 
inactivated versions of a Family B polymerase prepared, for example, by chemical modification 
or antibody complexing) of a Family B polymerase mutated at sites corresponding to L408, P410 
and or A485 of SEQ ID NO: 2. 

In one embodiment, the DNA polymerase has a dual mutation comprising comprising a 
serine to proline mutation at a site corresponding to S345 of SEQ ID NO: 2; and a proline to 
leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 

The invention encompasses purified thermostable DNA polymerase having an amino acid 
sequence presented in SEQ ID NO: 2 from residue 1 to 776. 

In one embodiment, the thermostable DNA polymerase is isolated from Thermococcus 
species JDF-3. 

In another embodiment, the thermostable polymerase is isolated from a recombinant 
organism transformed with a vector that codes for the expression of Thermococcus species JDF- 
3 DNA polymerase. 

The invention further encompasses a recombinant vector comprising the nucleotide 
sequence presented in SEQ ID NO: 1. 

The invention further encompasses an isolated recombinant polypeptide comprising the 
amino acid sequence of SEQ ID NO: 2 or a functional fragment thereof. 
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The invention further encompasses an isolated recombinant DNA polymerase from 
Thermococcus species JDF-3 that is 3' to 5' exonuclease deficient. 

In one embodiment, the isolated recombinant DNA polymerase of has an aspartic acid to 
threonine or alanine mutation at the amino acid corresponding to D141 of SEQ ID NO: 2 or a 
glutamic acid to alanine mutation at the amino acid corresponding to El 43 of SEQ ID NO: 2. 

In another embodiment, the isolated recombinant DNA polymerase has an aspartic acid 
to threonine or alanine mutation at the amino acid corresponding to D 141 of SEQ ID NO: 2 and 
a glutamic acid to alanine mutation at the amino acid corresponding to E143 of SEQ ID NO: 2. 

The invention further encompasses an isolated recombinant DNA polymerase having 
reduced discrimination against non-conventional nucleotides. 

In one embodiment, the DNA polymerase is a Family B DNA polymerase. 

In another embodiment, the DNA polymerase further comprises a mutation selected from 
the group consisting of: a leucine to histidine mutation at a site corresponding to L408 of SEQ 
ID NO: 2; a leucine to phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 2; 
a proline to leucine mutation at a site corresponding to P410 of SEQ ID NO: 2; and an alanine to 
threonine mutation at a site corresponding to A485 of SEQ ID NO: 2. 

The invention further encompasses an isolated recombinant DNA polymerase having the 
alanine to threonine mutation at the site corresponding to A485 of SEQ ID NO:2 further 
comprising a mutation selected from the group consisting of: a leucine to histidine mutation at a 
site corresponding to L408 of SEQ ID NO: 2; a leucine to phenylalanine mutation at a site 
corresponding to L408 of SEQ ID NO: 2; and a proline to leucine mutation at a site 
corresponding to P410 of SEQ ID NO: 2, 
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The invention further encompasses an isolated recombinant DNA polymerase having the 
a proline to leucine mutation at a site corresponding to P410 of SEQ ID NO: 2, further 
comprising of serine to proline mutation at a site corresponding to S345 of SEQ ID NO: 2 

In another embodiment, the DNA polymerase has reduced discrimination against a non- 
conventional nucleotide selected from the group consisting of: dideoxynucleotides, 
ribonucleotides and conjugated nucleotides. 

In another embodiment, conjugated nucleotide is selected from the group consisting of 
radiolabeled nucleotides, fluorescently labeled nucleotides, biotin labeled nucleotides, 
chemiluminescently labeled nucleotides and quantum dot labeled nucleotides. 

The invention further encompasses an isolated recombinant Family B DNA polymerase 
comprising an alanine to threonine mutation at the site corresponding to A485 of SEQ ID NO: 2 
or a mutation at a site corresponding to L408 or P410 of SEQ ID NO: 2, wherein the DNA 
polymerase has reduced discrimination against non-conventional nucleotides relative to the wild- 
type form of that polymerase. 

In one embodiment, the Family B DNA polymerase is 3' to 5' exonuclease deficient. 

In another embodiment, the Family B DNA polymerase has a mutation at an amino acid 
corresponding to D141 or E143 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase has an aspartic acid to threonine 
or alanine mutation at a site corresponding to D141 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase has a glutamic acid to alanine 
mutation at a site corresponding to E143 of SEQ ID NO: 2. 
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In another embodiment, the Family B DNA polymerase has a glutamic acid to alanine 
mutation at a site corresponding to El 43 of SEQ ID NO: 2 and has an aspartic acid to threonine 
or alanine mutation at the amino acid corresponding to D141 of SEQ ID NO: 2. 
In another embodiment, the Family B DNA polymerase is thermostable. 
In another embodiment, the Family B DNA polymerase is archaeal. 
In another embodiment, the Family B DNA polymerase comprises a leucine to histidine 
mutation at a site corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a leucine to 
phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 2. 
3 In another embodiment, the Family B DNA polymerase comprises a proline to leucine 

f mutation at a site corresponding to P41 0 of SEQ ID NO: 2. 

i In another embodiment, the Family B DNA polymerase comprises an alanine to 

threonine mutation at a site corresponding to A485 of SEQ ID NO: 2. 
;! In another embodiment, the Family B DNA polymerase comprising an alanine to 

j threonine mutation at a site corresponding to A485 of SEQ ID NO: 2 comprises a leucine to 
3 histidine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprising an alanine to 

threonine mutation at a site corresponding to A485 of SEQ ID NO: 2 comprises a leucine to 

phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprising an alanine to 

threonine mutation at a site corresponding to A485 of SEQ ID NO: 2 comprises a proline to 

leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 
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In another embodiment, the Family B DNA polymerase comprising a proline to leucine 
mutation at a site corresponding to P410 of SEQ ID NO: 2, further having a serine to proline 
mutation at a site corresponding to S345 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a serine to proline 
mutation at a site corresponding to S345 of SEQ ID NO: 2, and may further comprise a mutation 
at a site corresponding to T604 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a tyrosine to cysteine 
mutation at a site corresponding to Y497 of SEQ ID NO: 2, and may further comprise an 
isoleucine to valine mutation at a site corresponding to 1630 of SEQ ID NO:2. 
3 In another embodiment, the Family B DNA polymerase comprises a glutamic acid to 

3 lysine mutation at a site corresponding to E645 of SEQ ID NO: 2. 

Z In another embodiment, the Family B DNA polymerase comprises a glutamic acid to 

lysine mutation at a site corresponding to E578 of SEQ ID NO: 2, and may further comprise an 
*i arginine to methionine mutation at a site corresponding to R465 of SEQ ID NO: 2. 
j In another embodiment, the Family B DNA polymerase comprises a leucine to glutamine 

3 mutation at a site corresponding to L396 of SEQ ID NO: 2, and may further comprise a mutation 

at a site corresponding to V401, N424, P569, E617, or V640 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a serine to asparagene 

mutation at a site corresponding to S651 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a leucine to proline 

mutation at a site corresponding to L396 of SEQ ID NO: 2, and may further comprise a mutation 

at a site corresponding to E459 of SEQ ID NO: 2. 
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In another embodiment, the Family B DNA polymerase comprises a leucine to proline 
mutation at a site corresponding to L456 of SEQ ID NO: 2, and may further comprise a mutation 
at a site corresponding to E658 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprises a leucine to histidine 
mutation at a site corresponding to L408 of SEQ ID NO: 2, and may further comprise a mutation 
at a site corresponding to V437, or L478 of SEQ ID NO: 2. The L408H mutation was isolated 
both in the dideoxynucleotide and the dye-dideoxynucleotide screens described herein. 

In another embodiment, the Family B DNA polymerase comprises an tyrosine to 
asparagine mutation at a site corresponding to Y496 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase has reduced discrimination 
against a non-conventional nucleotide selected from the group consisting of: dideoxynucleotides, 
ribonucleotides and conjugated nucleotides. 

In another embodiment, the conjugated nucleotide is selected from the group consisting 
of radiolabeled nucleotides, fluorescently labeled nucleotides, biotin labeled nucleotides, 
chemiluminescently labeled nucleotides and quantum dot labeled nucleotides. 

In another embodiment, an isolated recombinant DNA polymerase having reduced 
discrimination against non-conventional nucleotides or an isolated recombinant Family B DNA 
polymerase comprising an alanine to threonine mutation at the site corresponding to A485 of 
SEQ ID NO: 2 or a mutation at a site corresponding to L408 or P410 of SEQ ID NO: 2, wherein 
the DNA polymerase has reduced discrimination against non-conventional nucleotides relative to 
the wild-type form of that polymerase further comprises a mutation at an amino acid residue in 
the polymerase that corresponds to a mutation selected from the group consisting of: a Y to V 
mutation at amino acid 409 of SEQ ID NO:2; an A to C, S, L, I, F, or V mutation at amino acid 
485 of SEQ ID NO: 2; a Y to S mutation at amino acid 494 of SEQ ID NO: 2; a Y to L mutation 
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at amino acid 496 of SEQ ID NO: 2; and an A to Y mutation at amino acid 490 of SEQ ID NO: 
2. 

In another embodiment, an isolated recombinant DNA polymerase having reduced 
discrimination against non-conventional nucleotides or an isolated recombinant Family B DNA 
polymerase comprising an alanine to threonine mutation at the site corresponding to A485 of 
SEQ ID NO: 2 or a mutation at a site corresponding to L408 or P410 of SEQ ID NO: 2, wherein 
the DNA polymerase has reduced discrimination against non-conventional nucleotides relative to 
the wild-type form of that polymerase further comprises a mutation at an amino acid of the 
polymerase corresponding to one of amino acids 483 to 496, inclusive, of SEQ ID NO: 2. 

In one embodiment, the mutation is at an amino acid of the polymerase corresponding to 
one of amino acids 485, 490, 494, or 496 of SEQ ID NO: 2. 

The invention further encompasses an isolated recombinant Family B DNA polymerase 
comprising an alanine to threonine mutation at an amino acid corresponding to A485T of SEQ 
ID NO: 2 and at least one substitution in the polymerase of an amino acid corresponding to 
L408, Y409, or P410, respectively, of SEQ ID NO: 2. 

The invention further encompasses an isolated recombinant Family B DNA polymerase 
comprising an amino acid other than A at an amino acid of the polymerase corresponding to 
A485 of SEQ ID NO: 2, and at least one substitution in the polymerase of an amino acid 
corresponding to L408, Y409, or P410, respectively, of SEQ ID NO: 2. 

The invention further encompasses a recombinant vector comprising a nucleic acid 
sequence encoding the Family B DNA polymerase. 

The invention further encompasses a method of labeling a complementary strand of 
DNA, the method comprising the step of contacting a template DNA molecule with a 
recombinant Family B DNA polymerase from Thermococcus species JDF-3, wherein the DNA 
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polymerase has reduced discrimination against non-conventional nucleotides, and a non- 
conventional nucleotide, under conditions and for a time sufficient to permit the DNA 
polymerase to synthesize a complementary DNA strand and to incorporate the non-conventional 
nucleotide into the synthesized complementary DNA strand. 

The invention further encompasses a method of labeling a complementary strand of 
DNA, the method comprising the step of contacting a template DNA molecule with a 
recombinant Family B DNA polymerase comprising an alanine to threonine mutation at a site 
corresponding to A485 of SEQ ID NO: 2 or a mutation at a site corresponding to L408 or P410 
of SEQ ID NO: 2, wherein the DNA polymerase has reduced discrimination against non- 
conventional nucleotides, and a non-conventional nucleotide, under conditions and for a time 
sufficient to permit the DNA polymerase to synthesize a complementary DNA strand and to 
incorporate the non-conventional nucleotide into the synthesized complementary DNA strand. 

In one embodiment, the recombinant Family B DNA polymerase is 3' to 5' exonuclease 
deficient. 

In another embodiment, the recombinant Family B polymerase comprises a leucine to 
histidine mutation at a site corresponding to amino acid L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase comprises a leucine to 
phenylalanine mutation at a site corresponding to amino acid L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase comprises a proline to 
leucine mutation at a site corresponding to amino acid P410 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase comprises an alanine to 
threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2. 
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In another embodiment, the recombinant Family B polymerase comprising an alanine to 
threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2 comprises a 
leucine to histidine mutation at an amino acid corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase comprising an alanine to 
threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2 comprises a 
leucine to phenylalanine mutation at an amino acid corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase comprising an alanine to 
threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2 comprises a 
proline to leucine mutation at an amino acid corresponding to P410 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase has reduced 
discrimination against a non-conventional nucleotide selected from the group consisting of: 
dideoxynucleotides, ribonucleotides, and conjugated nucleotides. 

In another embodiment, the conjugated nucleotide is selected from the group consisting 
of radiolabeled nucleotides, fluorescently labeled nucleotides, biotin labeled nucleotides, 
chemiluminescently labeled nucleotides and quantum dot labeled nucleotides. 

The invention further encompasses a method of sequencing DNA comprising the steps of 
contacting a DNA strand to be sequenced with a sequencing primer, a recombinant Family B 
DNA polymerase from Thermococcus species JDF-3, wherein the DNA polymerase has reduced 
discrimination against non-conventional nucleotides, and a chain-terminating nucleotide analog, 
under conditions that permit the DNA polymerase to synthesize a complementary DNA strand, 
and to incorporate nucleotides into the synthesized complementary DNA strand, wherein 
incorporation of a chain-terminating nucleotide analog results in the termination of chain 
elongation, such that the nucleotide sequence of the template DNA strand is determined. 
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The invention further encompasses a method of sequencing DNA comprising the steps of 
contacting a DNA strand to be sequenced with a sequencing primer, a recombinant Family B 
DNA polymerase comprising an alanine to threonine mutation at a site corresponding to A485 of 
SEQ ID NO: 2 or a mutation at a site corresponding to L408, S345 or P410 of SEQ ID NO: 2, 
where the DNA polymerase has reduced discrimination against non-conventional nucleotides, 
and a chain-terminating nucleotide analog, under conditions that permit the DNA polymerase to 
synthesize a complementary DNA strand, and to incorporate nucleotides into the synthesized 
complementary DNA strand, wherein incorporation of a chain-terminating nucleotide analog 
results in the termination of chain elongation, such that the nucleotide sequence of the template 
DNA strand is determined. 

In one embodiment, the recombinant DNA polymerase is deficient in 3 ' to 5 5 
exonuclease activity. 

In another embodiment, the recombinant Family B polymerase has a leucine to histidine 
mutation at a site corresponding to amino acid L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase has a leucine to 
phenylalanine mutation at a site corresponding to amino acid L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase has a proline to leucine 
mutation at a site corresponding to amino acid P410 of SEQ ID NO: 2. 

In another embodiment, the Family B DNA polymerase comprising a proline to leucine 
mutation at a site corresponding to P410 of SEQ ID NO: 2, further having a serine to proline 
mutation at a site corresponding to S345 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase has an alanine to 
threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2. 
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In another embodiment, the recombinant Family B polymerase having an alanine to 
threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2 has a leucine to 
histidine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase having an alanine to 
threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2 has a leucine to 
phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

In another embodiment, the recombinant Family B polymerase having an alanine to 
threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 2 has a proline to 
leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 

In another embodiment, the chain-terminating nucleotide analog is a dideoxynucleotide. 

In another embodiment, the dideoxynucleotide is detectably labeled. 

In another embodiment, the dideoxynucleotide is fluorescently labeled. 

In another embodiment, the dideoxynucleotide is labeled with a moiety selected from the 
group consisting of fluorescein and rhodamine. 

The invention also encompasses a kit for performing the methods disclosed herein. 

The invention also encompasses methods of making a recombinant DNA polymerase as 
disclosed here, comprising culturing a host cell containing a nucleic acid sequence encoding said 
polymerase under conditions which permit production of said DNA polymerase. 

The invention encompasses a mixture of a mutant DNA polymerase described herein and 
another DNA polymerase such as Taq DNA polymerase (preferably the mutant form, F667Y). 
Such a mixture is usefil in that it may increase signal uniformity generated from polymerization 
of a labeled nucleotide into a synthetic nucleotide. 

As used herein, "discrimination" refers to the tendency of DNA polymerase to not 
incorporate non-conventional nucleotides into a nascent DNA polymer. DNA polymerase has 
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the ability to sense nucleotide structure, including but not limited to nucleotide base 
complementarity, and structural features of the sugar and heterocyclic base, thereby allowing 
DNA polymerase to preferentially utilize conventional deoxynucleotides rather than non- 
conventional nucleotides for incorporation into a nascent polymer. DNA polymerase strongly 
prefers to incorporate the conventional deoxynucleotides dATP, dCTP, dGTP and TTP into 
DNA polymers; the polymerase is unlikely to progress with an unconventional nucleotide in its 
binding pocket. 

As used herein, "reduced discrimination" refers to a reduction of at least 50% in the 
tendency of a DNA polymerase to exclude a non-conventional nucleotide from (that is, to not 

3 incorporate non-conventional nucleotides into) a nascent DNA polymer, relative to a parental or 
wild type DNA polymerase which does not exhibit reduced discrimination. The preference of 

; DNA polymerase to incorporate the conventional deoxynucleotides dATP, dCTP, dGTP and 
TTP rather than non-conventional nucleotides into DNA polymers is thereby reduced compared 

i to the natural level of preference, such that non-conventional nucleotides are more readily 
incorporated into DNA polymers by DNA polymerase. According to the invention, a 

3 polymerase exhibiting reduced discrimination will exhibit reduced discrimination against at least 
one non-conventional nucleotides, but may not exhibit reduced discrimination against all non- 
conventional nucleotides. 

According to the invention, discrimination is quantitated by measuring the concentration 
of a non-conventional nucleotide required to inhibit the incorporation of the corresponding 
conventional nucleotide by 50%. This concentration is referred to herein as the "150%" for a non- 
conventional nucleotide. Discrimination against a given non-conventional nucleotide is 
"reduced" if the 150% for that non-conventional nucleotide is reduced by at least two fold (50%) 
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relative to an identical assay containing, in place of the mutant DNA polymerase, a parental 
DNA polymerase. 

Alternatively, reduced discrimination may be quantitated by determining the amount of a 
non-conventional nucleotide (for example, a dideoxynucleotide, ribonucleotide, or cordycepin) 
required in a reaction with a mutant polymerase having reduced discrimination to generate a 
sequencing ladder identical to a sequencing ladder produced using the wild-type or parental 
enzyme. The sequencing ladder can be examined, for example, in the range of 1 to 400 bases 
from the primer terminus, and the ladders will be identical in the number of extension products 
generated as well as the lengths of extension products generated in the sequencing reaction. For 
this type of assay, a constant amount of dNTPs and varying amounts of non-conventional 
nucleotides are used to generate a sequencing ladder with both the wild-type (or parental) 
enzyme and the mutant poplymerase (for ribonucleotides, a sequencing ladder is generated by 
alkali cleavage of the polymerization products). See Gardner & Jack, 1999, supra. A mutant 
exhibits reduced discrimination if it requires at least two-fold (50%) less, five-fold (80%)less, 
ten-fold (100%) less, etc. of the amount of the non-conventional nucleotide used by the wild-type 
or parental polymerase to produce a sequencing ladder identical (with respect to the number and 
length of extension products generated) to that generated by the wild-type or parental enzyme. 

As used herein, the term "parental" or "progenitor" refers to a polymerase used as the 
starting material in generating a mutant polymerase having reduced discrimination. The term 
"parental" is meant to encompass not only a so-called "wild-type" enzyme as it occurs in nature, 
but also intermediate forms, for example, an exonuclease deficient enzyme that is used as the 
starting material for generating an enzyme with reduced discrimination against non-conventional 
nucleotides. 
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As used herein, "non-conventional nucleotide" refers to a) a nucleotide structure that is 
not one of the four conventional deoxynucleotides dATP, dCTP, dGTP, and TTP recognized by 
and incorporated by a DNA polymerase, b) a synthetic nucleotide that is not one of the four 
conventional deoxynucleotides in (a), c) a modified conventional nucleotide, or d) a 
ribonucleotide (since they are not normally recognized or incorporated by DNA polymerases) 
and modified forms of a ribonucleotide. Non-conventional nucleotides include but are not 
limited to those listed in Table III, which are commercially available, for example, from New 
England Nuclear. Any one of the above non-conventional nucleotides may be a "conjugated 
nucleotide", which as used herein refers to nucleotides bearing a detectable label, including but 
not limited to a fluorescent label, isotope, chemiluminescent label, quantum dot label, antigen, or 
affinity moiety. 

As used herein, the term "cell", "cell line" and "cell culture" can be used interchangeably 
and all such designations include progeny. Thus, the words "transformants" or "transformed 
cells" includes the primary subject cell and cultures derived therefrom without regard for the 
number of transfers. It is also understood that all progeny may not be precisely identical in DNA 
content, due to deliberate or inadvertent mutations. Mutant progeny that have the same 
functionality as screened for in the originally transformed cell are included. 

As used herein, the term "organism transformed with a vector" refers to an organism 
carrying a recombinant gene construct. 

As used herein, "thermostable" refers to a property of a DNA polymerase, such that the 
enzyme active at elevated temperatures and is resistant to DNA duplex-denaturing temperatures 
in the range of about 93°C to about 97 °C. "Active" means the enzyme retains the ability to 
effect primer extension reactions when subjected to elevated or denaturing temperatures for the 
time necessary to effect denaturation of double-stranded nucleic acids. Elevated temperatures as 
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used herein refer to the range of about 70°C to about 75 °C, whereas non-elevated temperatures 
as used herein refer to the range of about 35°C to about 50 °C. 

As used herein, "archaeal" refers to an organism or to a DNA polymerase from an 
organism of the kingdom Archaea. 

As used herein, "sequencing primer" refers to an oligonucleotide, whether natural or 
synthetic, which serves as a point of initiation of nucleic acid synthesis by a polymerase 
following annealing to a DNA strand to be sequenced. A primer is typically a single-stranded 
oligodeoxyribonucleotide. The appropriate length of a primer depends on the intended use of the 
primer, but for DNA sequencing applications typically ranges from about 15 to about 40 
nucleotides in length. 

As used herein, "Family B DNA polymerase" refers to any DNA polymerase that is 
classified as a member of the Family B DNA polymerases, where the Family B classification is 
based on structural similarity to E. coli DNA polymerase II. The Family B DNA polymerases, 
formerly known as a-family polymerases, include, but are not limited to those listed as such in 
Table I. 

As used herein, "Family A DNA polymerase" refers to any DNA polymerase that is 
classified as a member of the Family A DNA polymerases, where the Family A classification is 
based on structural similarity to E. coli DNA polymerase I. Family A DNA polymerases 
include, but are not limited to those listed as such in Table I. 

As used herein, "3' to 5' exonuclease deficient" or "3' to 5' exo" refers to an enzyme that 
substantially lacks the ability to remove incorporated nucleotides from the 3' end of a DNA 
polymer. DNA polymerase exonuclease activities, such as the 3' to 5' exonuclease activity 
exemplified by members of the Family B polymerases, can be lost through mutation, yielding an 
exonuclease-deficient polymerase. As used herein, a DNA polymerase that is deficient in 3' to 
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5' exonuclease activity substantially lacks 3' to 5 5 exonuclease activity. "Substantially lacks" 
encompasses a complete lack of activity, or a "substantial" lack of activity. "Substantial" lack of 
activity means that the 3' exonuclease activity of the mutant polymerase relative to the parental 
polymerase is 0.03%, and also may be 0.05% 5 0.1%, 1%, 5%, 10%, or 20%, but is not higher 
than 50% of the 3' exonuclease activity of the parental or wild type polymerase. 

As used herein, "mutation" refers to a change introduced into a starting parental DNA 
sequence that changes the amino acid sequence encoded by the DNA. The consequences of a 
mutation include but are not limited to the creation of a new character, property, function, or trait 
not found in the protein encoded by the parental DNA. 

As used herein, "wild-type" refers to the typical state of an organism, strain, gene, protein 
or characteristic as it occurs in nature. The wild-type is therefore the natural state that is 
distinguished from a mutant, which was derived from the wild type by introduction of change(s) 
to the wild-type. 

As used herein, "corresponding" refers to sequence similarity in a comparison of two or 
more nucleic acids or polypeptides, where functionally equivalent domains or sub-sequences are 
identified; such functionally equivalent domains or sub-sequences or amino acids within such a 
domain or sub-sequence are said to "correspond". That is, two or more sequences are compared 
through a comparative alignment analysis in which an entire sequence is examined for regions of 
sequence that are similar or identical, and thus regions likely to be functionally equivalent to 
regions from the other sequence(s) are identified. 

As used herein in reference to comparisons of an amino acid, amino acid sequence, or 
protein domain, the term "similar" refers to amino acids or domains that although not identical, 
represent "conservative" differences. By "conservative" is meant that the differing amino acid 
has like characteristics with the amino acid in the corresponding or reference sequence. Typical 
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conservative substitutions are among Ala, Val, Leu and He; among Ser and Thr; among the 
acidic residues Asp and Glu; among Asn and Gin; and among the basic residues Lys and Arg; or 
aromatic residues Phe and Tyr. In calculating the degree (most often as a percentage) of 
similarity between two polypeptide sequences, one considers the number of positions at which 
identity or similarity is observed between corresponding amino acid residues in the two 
polypeptide sequences in relation to the entire lengths of the two molecules being compared. 

As used herein, the term "functionally equivalent" means that a given motif, region, or 
amino acid within a motif or region performs the same function with regard to the overall 
function of the enzyme as a motif, region or amino acid within a motif or region performs in 
another enzyme. 

As used herein, "chain terminating nucleotide analog" refers to a nucleotide analog that 
once incorporated cannot serve as a substrate for subsequent extension by a DNA polymerase, 
thereby terminating the elongation of a DNA polymer by a DNA polymerase. Such a nucleotide 
analog typically lacks a hydroxyl group on its sugar moiety to which DNA polymerase can 
synthesize a phosphodiester bond with an incoming nucleotide. Chain terminating nucleotide 
analogs are a subset of non-conventional nucleotides, and include but are not limited to 
dideoxynucleotides. 

As used herein, "detectably labeled" refers to a structural modification that incorporates a 
functional group (label) that can be readily detected by various means. Compounds that can be 
detectably labeled include but are not limited to nucleotide analogs. Detectable nucleotide 
analog labels include but are not limited to fluorescent compounds, isotopic compounds, 
chemiluminescent compound, quantum dot labels, biotin, enzymes, electron-dense reagents, and 
haptens or proteins for which antisera or monoclonal antibodies are available. The various 
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means of detection include but are not limited to spectroscopic, photochemical, biochemical, 
immunochemical, or chemical means. 

As used herein in reference to a polynucleotide or polypeptide, the term "isolated" means 
that a naturally occurring sequence has been removed from its normal cellular environment or is 
synthesized in a non-natural environment (e.g., artificially synthesized). Thus, the sequence may 
be in a cell-free solution or placed in a different cellular environment. The term does not imply 
that the sequence is the only nucleotide or polypeptide chain present, but that it is essentially free 
(about 90-95% pure at least) of non-nucleotide or non-polypeptide material, respectively, 
naturally associated with it. 

As used herein, the term "recombinant" refers to a polynucleotide or polypeptide that is 
altered by genetic engineering (i.e., by modification or manipulation of the genetic material 
encoding that polynucleotide or polypeptide). 

The invention encompasses full length mutant DNA polymerases, as described herein, as 
well as a functional fragment of a mutant polymerase, that is, a fragment of a DNA polymerase 
that is less than the entire amino acid sequence of the mutant polymerase and retains the ability, 
under at least one set of conditions, to catalyze the polymerization of a polynucleotide. Such a 
functional fragment may exist as a separate entity, or it may be a constituent of a larger 
polypeptide, such as a fusion protein. 

As used herein, the term "complementary DNA strand" refers to that DNA molecule 
synthesized from a template DNA molecule by a DNA polymerase in a primer extension 
reaction. 

As used herein, the term "template DNA molecule" refers to that strand of a nucleic acid 
from which a complementary nucleic acid strand is synthesized by a DNA polymerase, for 
example, in a primer extension reaction. 
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Further features and advantages of the invention will become more fully apparent in the 
following description of the embodiments and drawings thereof, and from the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the DNA sequence encoding Thermococcus species JDF-3 DNA 
polymerase (intein removed) (SEQ ID NO: 1). 

Figure 2 shows the amino sequence of Thermococcus species JDF-3 DNA polymerase 
(intein removed) (SEQ ID NO: 2). 

Figure 3 shows the amino acid sequence of the genomic clone encoding Thermococcus 
species JDF-3 DNA polymerase (SEQ ID NO: 3). The position of an intein, removed by post- 
translational processing, is shown. 

Figure 4 shows the DNA sequence of the genomic clone encoding Thermococcus species 
JDF-3 DNA polymerase (SEQ ID NO: 4). DNA sequences are shown which correspond to 5' 
and V untranslated regions, polymerase-coding regions (exteins), and an intein-coding region. 

Figure 5 shows nucleotide incorporation by JDF-3 mutants. Lambda phage clones which 
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incorporated P-labeled ddNTPs in the primary library screen were rescreened to assess P- 
ddNTP incorporation in the presence of: (panel 1) 0.5mM MnCl 2 or (panel 2) L5mM MgCb. 
Polymerase activity was measured using 33 P-dNTPs in the presence of 1 .5mM MgCk (panel 3). 
Nucleotide utilization is shown for clones 1-18 and for the parental #550 clone. 

Figure 6 shows 33 P-ddNTP cycle sequencing reactions performed using JDF-3 
polymerase mutants. Purified JDF-3 mutants were substituted into the Thermo Sequenase 
radiolabeled terminator cycle sequencing kit. DNA sequencing ladders were generated as per the 
kit's instructions using the following polymerases: (A) Thermo Sequenase (B) JDF-3 #550 clone 
(parental) (C) JDF-3 A485T mutant (clone pl2) (D) JDF-3 P410L mutant (clone pi 1) (E) JDF-3 
P410L mutant (clone p8). The top of the original sequencing gel is shown on the side. The lanes 
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are: (bottom) ddGTP, ddATP, ddTTP, ddCTP (top). Clones p8, pi 1, and pl2 contain ancillary 
mutations and an amino-terminal tag. 

Figure 7 shows cycle sequencing reactions performed using dye-labeled ddNTPs and 
JDF-3 polymerase mutants. DNA sequencing ladders were generated using (1) 2.14|uM dNTP: 
0.0214pM ddNTP; (2) 2.14pM dNTP: 0.214^iM ddNTP; or (3) 2.14jiM dNTP: 2.14pM ddNTP. 
The following purified DNA polymerases were used: (A) JDF-3 #550 clone (parental) (B) 
Thermo Sequenase (C) JDF-3 P410L mutant (clone p8, contains ancillary mutations and an 
amino tag) (E) JDF-3 L408H mutant (clone 1-1). The top of the original sequencing gel is 
shown on the right hand side. 

IK. 

!? Figure 8 shows cycle sequencing reactions performed using the JDF-3 P410L/A485T 

i double mutant and a- 33 P Dideoxynucleotides. DNA sequencing ladders were generated using the 
; JDF-3 P410L/A485T double mutant at (A) 2|il (B) ljLtl (C) O.Sjil, the JDF-3 P410L mutant (clone 

p8, contains ancillary mutations and an amino-terminal tag)(D), or Thermo Sequenase (E). The 
3 top of the original sequencing gel is shown on the left side. The lanes are: (bottom) ddGTP, 
J ddATP, ddTTP, ddCTP (top). 

Figure 9 shows the result of ribonucleotide incorporation assays using exo JDF-3 (550) 
and mutants of this progenitor clone. The ratios of ribonucleotide versus deoxynucleotide 
incorporation are plotted for JDF-3 550, JDF-3 L408H, JDF-3 L408F and JDF-3 A485T. 

Figure 10 shows the traces of the sequence generated by four versions of JDF-3 DNA 
polymerase and FAM ddCTP. Panel A shows the minimal trace produced by the progenitor 
polymerase JDF-3 550, Panel B demonstrates the slightly improved trace made by JDF-3 P410L, 
Panel C shows the sequence generated by the double mutant S345P and P410L, and Panel D 
shows the trace created by JDF-3 S345P. 
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Figure 1 1 shows the difference in peak uniformity demonstrated by Thermo Sequenase in 
Panel A and the double mutant JDF-3 S345P + P410L in Panel B. 

Figure 12 shows the separated products of 3' extension of a labeled oligonucleotide with 
the dideoxynucleotide thymidine triphosphate of ROX-ddUTP (New England Nuclear (NEN) 
NEL476) or Fluorescein- 12-ddUTP (NEN NEL401). Mutant 4 is JDF-3 S345P, Mutant 2 is JDF- 
3 P410L, Mutant 3 is JDF-3 A485T and Mutant 5 is Y496N. F indicates FLU ddUTP and R 
indicates ROX ddUTP. 

Figure 13 shows a graphic representation of the relative band intensities form Figure 12. 
The numerical values are generated by dividing the intensity value of the ddTTP band into the 
intensity value for the Fluroescein-12-ddUTP bands. 

Figure 14 shows the sequence alignment of dye-dideoxynucleotide selected JDF-3 
mutants (amino acids 301-480). Nucleic acid residues highlighted by white boxes indication the 
location of a mutation. The mutation S345P is one of two mutations present in mutant 28. 

Figure 15 shows the sequence alignment of dye-dideoxynucleotide selected JDF-3 
(amino acids 481-660). Nucleic acid residues highlighted by white boxes indication the location 
of a mutation. 
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DESCRIPTION 

The invention is based on the discovery of Family B DNA polymerases that bear one 
or more genetic alterations resulting in reduced discrimination against non-conventional 
nucleotides relative to their unmodified wild-type forms. All references described herein are 
incorporated by reference herein in their entirety. 

Family B DNA Polymerase Exhibiting Reduced Discrimination Against Non-Conventional 
Nucleotides: 

A. DNA Polymerases Useful According to the Invention 

According to the invention, DNA polymerases of Family B may be mutated to 

generate enzymes exhibiting reduced discrimination against non-conventional nucleotides. 

Table I includes a non-limiting list of known DNA polymerases categorized by family. 

Table I. DNA POLYMERASES BY FAMILY 
FAMILY A DNA POLYMERASES 



Bacterial DNA Polymerases Reference 

a) E. coli DNA polymerase I (1) 

b) Streptococcus pneumoniae DNA polymerase I (2) 

c) Thermus aquaticus DNA polymerase I (3) 

d) Thermus flavus DNA polymerase I (4) 

e) Thermotoga maritima DNA polymerase I 

Bacteriophage DNA Polymerases 

a) T5 DNA polymerase (5) 

b) T7 DNA polymerase (6) 

c) Spol DNA polymerase (7) 

d) Spo2 DNA polymerase (8) 
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Mitochondrial DNA polymerase 

Yeast Mitochondrial DNA polymerase II 



(9, 10,11) 



FAMILY B DNA POLYMERASES 



Bacterial DNA polymerase 

E. coli DNA polymerase II (15) 

Bacteriophage DNA polymerase 

a) PRD1 DNA polymerase (16, 17) 

b) 929 DNA polymerase (18) 

c) M2 DNA polymerase (19) 

d) T4 DNA polymerase (20) 

Archaeal DNA polymerase 

a) Thermococcus litoralis DNA polymerase (Vent) (21) 

b) Pyrococcus furiosus DNA polymerase (22) 

c) Sulfolobus solfataricus DNA polymerase (23) 

d) Thermococcus gorgonarius DNA polymerase (64) 

e) Thermococcus species TY (65) 

f) Pyrococcus species strain KODI (66) 

g) Sulfolobus acidocaldarius (67) 

h) Thermococcus species 9°N-7 (68) 

i) Pyrodictium occultum (69) 
j) Methanococcus voltae (70) 
k) Desulfurococcus strain TOK (D. Tok Pol) (7 1 ) 

Eukaryotic Cell DNA polymerase 

(1) DNA polymerase alpha 

a) Human DNA polymerase (alpha) (24) 

b) S.cerevisiae DNA polymerase (alpha) (25) 
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c) S.pombe DNA polymerase I (alpha) (26) 

d) Drosophila melanogaster DNA polymerase (alpha) (27) 

e) Trypanosoma brucei DNA polymerase (alpha) (28) 

(2) DNA polymerase delta 

a) Human DNA polymerase (delta) (29 5 3 0) 

b) Bovine DNA polymerase (delta) (3 1) 

c) S. cerevisiae DNA polymerase III (delta) (32) 

d) S. pombe DNA polymerase III (delta) (33) 

e) Plasmodiun falciparum DNA polymerase (delta) (34) 

(3) DNA polymerase epsilon 

S. cerevisiae DNA polymerase II (epsilon) (35) 

(4) Other eukaryotic DNA polymerase 

S. cerevisiae DNA polymerase Rev3 (36) 

Viral DNA polymerases 

a) Herpes Simplex virus type 1 DNA polymerase (37) 

b) Equine herpes virus type 1 DNA polymerase (38) 

c) Varicella-Zoster virus DNA polymerase (39) 

d) Epstein-Barr virus DNA polymerase (40) 

e) Herpesvirus saimiri DNA polymerase (41) 

f) Human cytomegalovirus DNA polymerase (42) 

g) Murine cytomegalovirus DNA polymerase (43) 

h) Human herpes virus type 6 DNA polymerase (44) 

i) Channel Catfish virus DNA polymerase (45) 
j) Chlorella virus DNA polymerase (46) 
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k) Fowlpox virus DNA polymerase (47) 

1) Vaccinia virus DNA polymerase (48) 

m) Choristoneura biennis DNA polymerase (49) 
n) Autographa California nuclear polymerase virus (AcMNP V) 

DNA polymerase (50) 

0) Lymantria dispar nuclear polyhedrosis virus DNA polymerase (51) 
p) Adenovirus-2 DNA polymerase (52) 
q) Adenovirus-7 DNA polymerase (53) 
r) Adenovirus- 1 2 DNA polymerase (54) 

Eukaryotic linear DNA plasmid encoded DNA polymerases 

a) S-l Maize DNA polymerase (55) 

b) kalilo neurospora intermedia DNA polymerase (56) 

c) pA12 ascobolus immersus DNA polymerase (57) 

d) pCLKl Claviceps purpurea DNA polymerase (58) 

e) maranhar neurospora crassa DNA polymerase (59) 

f) pEM Agaricus bitorquis DNA polymerase (60) 

g) pGKLl Kluyveromyces lactis DNA polymerase (61) 

h) pGKL2 Kluyveromyces lactis DNA polymerase (62) 

1) pSKL Saccharomyces kluyveri DNA polymerase (63) 



B. Plasmids 

The starting sequences for the generation of Family B DNA polymerases according to 
the invention may be contained in a plasmid vector. A non-limiting list of cloned Family B 
DNA polymerases and their GenBank Accession numbers are listed in Table IL 
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Table II. Accession Information for Cloned Family B Polymerases 



Vent Thermococcus litoralis 
ACCESSION AAA72101 
PID g348689 

VERSION AAA72101.1 GL348689 
DBSOURCE locus THCVDPE accession M74198.1 

THEST THERMOCOCCUS SP. (STRAIN TY) 

ACCESSION 033845 

PID g3913524 

VERSION 033845 GL3913524 

DBSOURCE swissprot: locus DPOL_THEST, accession 033 845 

Pab Pyrococcus abyssi 
ACCESSION P77916 
PID g3913529 
VERSION P77916 GL3913529 

DBSOURCE swissprot: locus DPOL PYRAB, accession P7791 6 

PYRHO Pyrococcus horikoshii 
ACCESSION 059610 
PID g3913526 
VERSION 059610 GL3913526 

DBSOURCE swissprot: locus DPOLJPYRHO, accession 0596 1 0 

PYRSE PYROCOCCUS SP. (STRAIN GE23) 

ACCESSION P77932 

PID g3913530 

VERSION P77932 GL3913530 

DBSOURCE swissprot: locus DPOL_PYRSE, accession P77932 

DeepVent Pyrococcus sp. 
ACCESSION AAA67131 
PID g436495 

VERSION AAA67131.1 GI:436495 
DBSOURCE locus PSU00707 accession U00707. 1 

Pfu Pyrococcus furiosus 
ACCESSION P80061 
PID g399403 
VERSION P80061 GI:399403 

DBSOURCE swissprot: locus DPOL PYRFU, accession P8006 1 

JDF-3 Thermococcus sp. 

Unpublished 

Baross gi|2097756|pat|US|560201 1 1 12 Sequence 12 from patent US 560201 1 
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9degN THERMOCOCCUS SP. (STRAIN 90N-7). 

ACCESSION Q56366 

PID g39 13540 

VERSION Q56366 GE3913540 

DBSOURCE swissprot: locus DPOLJTHES9, accession Q56366 

KOD Pyrococcus sp. 
ACCESSION BAA06142 
PID gl620911 

VERSION BAA06142.1 GI: 1620911 

DBSOURCE locus PYWKODPOL accession D29671 . 1 

Tgo Thermococcus gorgonarius. 

ACCESSION 4699806 

PID g4699806 

VERSION GI:4699806 

DBSOURCE pdb: chain 65, release Feb 23, 1 999 

THEFM Thermococcus fiimicolans 

ACCESSION P74918 

PID g3913528 

VERSION P74918 GI:3913528 

DBSOURCE swissprot: locus DPOL THEFM, accession P7491 8 

METTH Methanobacterium thermoautotrophicum 

ACCESSION 027276 

PID g3913522 

VERSION 027276 GL3913522 

DBSOURCE swissprot: locus DPOLMETTH, accession 027276 

Metj a Methanococcus j annaschii 

ACCESSION Q58295 

PID g39 15679 

VERSION Q58295 GL3915679 

DB SOURCE swissprot: locus DPOL_METJA, accession Q5 8295 

POC Pyrodictium occultum 
ACCESSION B56277 
PID gl363344 
VERSION B56277 GL1363344 
DBSOURCE pir: locus B56277 

Apel Aeropyrum pernix 
ACCESSION BAA81109 
PID g5 105797 

VERSION BAA81 109.1 GL5105797 
DBSOURCE locus AP000063 accession AP000063 . 1 
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ARCFU Archaeoglobus fulgidus 
ACCESSION 029753 
PID g3122019 
VERSION 029753 GI:3 122019 

DBSOURCE swissprot: locus DPOL_ARCFU, accession 029753 

Desulfurococcus sp. Tok. 

ACCESSION 6435708 

PID g64357089 

VERSION GT:6435708 

DBSOURCE pdb. chain 65, release Jun 2, 1999 

Plasmids acceptable for the expression of modified forms of Family B DNA polymerases 
may be selected from a large number known in the art by one of skill in the art. A plasmid 
vector for expression of a modified DNA polymerase according to the invention will preferably 
comprise sequences directing high level expression of a DNA polymerase, and will more 
preferably comprise sequences directing inducible, high level expression of a DNA polymerase. 
As one example of an inducible high level expression system, plasmids placing a modified DNA 
polymerase coding sequence according to the invention under the control of a bacteriophage T7 
promoter may be introduced to bacteria containing an inducible T7 RNA polymerase gene within 
their chromosome. Induction of the T7 RNA polymerase gene subsequently induces high level 
expression of the T7 promoter-driven modified DNA polymerase gene (see for example, Gardner 
& Jack, Nucleic Acids Res. 27: 2545). 
C. Mutagenesis 

The cloned wild-type form of a Family B DNA polymerase may be mutated to generate 
forms exhibiting reduced discrimination against non-conventional nucleotides by a number of 
methods. 

First, methods of random mutagenesis which will result in a panel of mutants bearing one 
or more randomly-situated mutations exist in the art . Such a panel of mutants may then be 
screened for those exhibiting reduced discrimination relative to the wild-type polymerase (see 
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"Methods of Evaluating Mutants for Reduced Discrimination", below). An example of a method 
for random mutagenesis is the so-called "error-prone PCR method". As the name implies, the 
method amplifies a given sequence under conditions in which the DNA polymerase does not 
support high fidelity incorporation. The conditions encouraging error-prone incorporation for 
different DNA polymerases vary, however one skilled in the art may determine such conditions 
for a given enzyme. A key variable for many DNA polymerases in the fidelity of amplification 
is, for example, the type and concentration of divalent metal ion in the buffer. The use of 
manganese ion and/or variation of the magnesium or manganese ion concentration may therefore 
be applied to influence the error rate of the polymerase. 

Second, there are a number of site-directed mutagenesis methods known in the art which 
allow one to mutate a particular site or region in a straightforward manner. There are a number 
of kits available commercially for the performance of site-directed mutagenesis, including both 
conventional and PCR-based methods. Examples include the EXSITE™ PCR-Based Site- 
directed Mutagenesis Kit available from Stratagene (Catalog No. 200502; PCR based) and the 
QUIKCHANGE™ Site-directed mutagenesis Kit from Stratagene (Catalog No. 200518; non- 
PCR-based), and the CHAMELEON® double-stranded Site-directed mutagenesis kit, also from 
Stratagene (Catalog No. 200509). 

Older methods of site-directed mutagenesis known in the art relied upon sub-cloning of 
the sequence to be mutated into a vector, such as an Ml 3 bacteriophage vector, that allows the 
isolation of single-stranded DNA template. In these methods one annealed a mutagenic primer 
(i.e., a primer capable of annealing to the site to be mutated but bearing one or mismatched 
nucleotides at the site to be mutated) to the single-stranded template and then polymerized the 
complement of the template starting from the 3' end of the mutagenic primer. The resulting 
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duplexes were then transformed into host bacteria and plaques were screened for the desired 
mutation. 

More recently, site-directed mutagenesis has employed PCR methodologies, which have 
the advantage of not requiring a single-stranded template. In addition, methods have been 
developed that do not require sub-cloning. Several issues must be considered when PCR-based 
site-directed mutagenesis is performed. First, in these methods it is desirable to reduce the 
number of PCR cycles to prevent expansion of undesired mutations introduced by the 
polymerase. Second, a selection must be employed in order to reduce the number of non- 
mutated parental molecules persisting in the reaction. Third, an extended-length PCR method is 
preferred in order to allow the use of a single PCR primer set. And fourth, because of the non- 
template-dependent terminal extension activity of some thermostable polymerases it is often 
necessary to incorporate an end-polishing step into the procedure prior to blunt-end ligation of 
the PCR-generated mutant product. 

The protocol described below accomodates these considerations through the following 
steps. First, the template concentration used is approximately 1 000-fold higher than that used in 
conventional PCR reactions, allowing a reduction in the number of cycles from 25-30 down to 5- 
10 without dramatically reducing product yield. Second, the restriction endonuclease Dpnl 
(recognition target sequence: 5-Gm6ATC-3, where the A residue is methylated) is used to select 
against parental DNA, since most common strains of £ coli Dam methylate their DNA at the 
sequence 5-GATC-3. Third, Taq Extender is used in the PCR mix in order to increase the 
proportion of long (i.e., full plasmid length) PCR products. Finally, Pfu DNA polymerase is 
used to polish the ends of the PCR product prior to intramolecular ligation using T4 DNA ligase. 
The method is described in detail as follows: 

PCR-based Site Directed Mutagenesis of the 3'-5' Exonuclease domain 
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Plasmid template DNA (approximately 0.5 pmole) is added to a PCR cocktail containing: 
lx mutagenesis buffer (20 mM Tris HC1, pH 7.5; 8 mM MgC12; 40 ug/ml BSA); 12-20 pmole of 
each primer (one of skill in the art may design a mutagenic primer as necessary, giving 
consideration to those factors such as base composition, primer length and intended buffer salt 
concentrations that affect the annealing characteristics of oligonucleotide primers; one primer 
must contain the desired mutation, and one (the same or the other) must contain a 5' phosphate to 
facilitate later ligation), 250 uM each dNTP, 2.5 U Taq DNA polymerase, and 2.5 U of Taq 
Extender (Available from Stratagene; See Nielson et al. (1994) Strategies 7: 27, and U.S. Patent 
No. 5,556,772). The PCR cycling is performed as follows: 1 cycle of 4 min at 94°C, 2 min at 
50°C and 2 min at 72°C; followed by 5-10 cycles of 1 min at 94°C, 2 min at 54°C and 1 min at 
72°C. The parental template DNA and the linear, PCR-generated DNA incorporating the 
mutagenic primer are treated with Dpnl (10 U) and Pfu DNA polymerase (2.5U). This results in 
the Dpnl digestion of the in vivo methylated parental template and hybrid DNA and the removal, 
by Pfu DNA polymerase, of the non-template-directed Taq DNA polymerase-extended base(s) 
on the linear PCR product. The reaction is incubated at 37°C for 30 min and then transferred to 
72°C for an additional 30 min. Mutagenesis buffer (1 15 ul of lx) containing 0.5 mM ATP is 
added to the DpnI-digested, Pfu DNA polymerase-polished PCR products. The solution is 
mixed and 10 ul are removed to a new microfuge tube and T4 DNA ligase (2-4 U) is added. The 
ligation is incubated for greater than 60 min at 37°C. Finally, the treated solution is transformed 
into competent E. coli according to standard methods. 
D. Non-Conventional Nucleotides Useful According to the Invention. 

There is a wide variety of non-conventional nucleotides available in the art. Any or all of 
them are contemplated for use with a DNA polymerase of the invention. A non-limiting list of 
such non-conventional nucleotides is presented in Table III. 
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Table III. Non-Conventional Nucleotides 
DIDEOXYNUCLEOTIDE ANALOGS 
Fluorescein Labeled Fluorophore Labeled 



Fluorescein- 1 2-ddCTP 
Fluorescein-12-ddUTP 
Fluorescein- 1 2-ddATP 
Fluorescein-12-ddGTP 
Fluorescein-N6-ddATP 



Eosin-6-ddCTP 
Coumarin-5-ddUTP 
Tetramethylrhodamine-6-ddUTP 
Texas Red-5-ddATP 
LISSAMINE™-rhodamine-5-ddGTP 



FAM Labeled 

FAM-ddUTP 
FAM-ddCTP 
FAM-ddATP 
FAM-ddGTP 



TAMRA Labeled 



TAMRA-ddUTP 



TAMRA-ddCTP 



TAMRA-ddATP 



TAMRA-ddGTP 



ROX Labeled 

ROX-ddUTP 
ROX-ddCTP 
ROX-ddATP 
ROX-ddGTP 



JOE Labeled 

JOE-ddUTP 
JOE-ddCTP 
JOE-ddATP 
JOE-ddGTP 



R6G Labeled 

R6G-ddUTP 
R6G-ddCTP 
R6G-ddATP 
R6G-ddGTP 



R110 Labeled 

RllO-ddUTP 
RllO-ddCTP 
RllO-ddATP 
RllO-ddGTP 
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BIOTIN Labeled 

Biotin-N6-ATP 

DEOXYNUCLE( 

TTP Analogs 

Fluorescein-12-dUTP 

Coumarin-5-dUTP 

Tetramethylrhodamine-6-dUTP 

Tetraethylrhodamine-6-dUTP 

Texas Red-5-dUTP 

LISSAMINE™-rhodamine-5-dUTP 

Naphthofluorescein-5-dUTP 

Fluorescein Chlorotriazinyl-4-dUTP 

Pyrene-8-dUTP 

Diethylaminocoumarin-5-dUTP 

dCTP Analogs 
Coumarin-5 -dCTP 
Fluorescein- 12-dCTP 
Tetramethylrhodamine-6-dCTP 
Texas Red-5-dCTP 
LISSAMINE™-rhodamine-5-dCTP 
Naphthofluorescein-5 -dCTP 
Fluorescein Chlorotriazinyl-4-dCTP 



DNP Labeled 

DNP-N6-ddATP 
ANALOGS 
dATP-Analogs 
Coumarin-5-dATP 
Diethylaminocoumarin-5 -d ATP 
Fluorescein-12-dATP 
Fluorescein Chlorotriazinyl-4-dATP 
LISSAMINE™-rhodamine-5-dATP 
Naphthofluorescein-5 -dATP 
Pyrene-8-dATP 

Tetramethylrhodamine-6-dATP 
Texas Red-5-dATP 
DNA-N6-dATP 

Biotin-N6-dATP 

dGTP Analogs 

Coumarin-5 -dGTP 

Fluorescein-12-dGTP 

Tetratnethylrhodamine-6-dGTP 

Texas Red-5-dGTP 

LISSAMINE™-rhodamine-5-dGTP 



Pyrene-8-dCTP 
Fluorescein-N4-dCTP 
Biotin-N4-dCTP 
DNP-N4-dCTP 



Diethylarainocoumarin-5-dCTP 



CTP Analogs 
Coumarin-5-CTP 
Fluorescein- 1 2-CTP 
Tetrainethylrhodainine-6-CTP 
Texas Red-5-CTP 
LISSAMINE™-rhodamine-5-CTP 

Naphthofluorescein-5 -CTP 

Fluorescein Chlorotriazinyl-4-CTP 

Pyrene-8-CTP 

Fluorescein-N4-CTP 

Biotin-N4-CTP 

ATP Analogs 

Coumarin-5-ATP 

Fluorescein- 1 2 - ATP 

Tetramethylrhodamine-6-ATP 

Texas Red-5-ATP 

LISSAMINE™-rhodamine-5-ATP 

Fluorescein-N6-ATP 



RIBONUCLEOTIDE ANALOGS 
UTP Analogs 



Fluorescein- 1 2-UTP 
Coumarin-5 -UTP 
Tetramethylrhodamine-6-UTP 
Texas Red-5-UTP 
LISSAMINE™-5-UTP 

Naphthofluorescein-5-UTP 
Fluorescein Chlorotriazinyl-4-UTP 
Pyrene-8-UTP 
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Biotin-N6-ATP 
DNP-N6-ATP 

Additional non-conventional nucleotides useful according to the invention include, but 
are not limited to 7-deaza-dATP, 7-deaza-dGTP, 5'-methyl-2'-deoxycytidine-5Mriphosphate. 
Further non-conventional nucleotides or variations on those listed above are discussed by Wright 
& Brown, 1990, Pharmacol. Ther. 47: 447. It is specifically noted that ribonucleotides qualify as 
non-conventional nucleotides, since ribonucleotides are not generally incorporated by DNA 
polymerases. Modifications of Family B DNA polymerases that result in the ability, or 
enhanced ability, of the polymerase to incorporate labeled or unlabeled ribonucleotides are 
specifically contemplated herein. 

E. Methods of Evaluating Mutants for Reduced Discrimination 

Random or site-directed mutants generated as known in the art or as described herein 
and expressed in bacteria may be screened for reduced discrimination against non-conventional 
nucleotides by several different assays. In one method, Family B DNA polymerase proteins 
expressed in lytic lambda phage plaques generated by infection of host bacteria with expression 
vectors based on, for example, Lambda ZapII®, are transferred to a membrane support. The 
immobilized proteins are then assayed for polymerase activity on the membrane by immersing 
the membranes in a buffer containing a DNA template and the unconventional nucleotides to be 
monitored for incorporation. 

Mutant polymerase libraries may be screened using a variation of the technique used 
by Sagner et al (Sagner, G., Ruger, R., and Kessler, C. (1991) Gene 97:1 19-123). For this 
approach, lambda phage clones are plated at a density of 10-20 plaques per square centimeter. 
Proteins present in the plaques are transferred to filters and moistened with polymerase screening 
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buffer (50mM Tris (pH 8.0), 7mM MgCl 2 , 3mM p-ME). The filters are kept between layers of 
plastic wrap and glass while the host cell proteins are heat-inactivated by incubation at 65°C for 
30 minutes. The heat-treated filters are then transferred to fresh plastic wrap and approximately 
35|Lil of polymerase assay cocktail are added for every square centimeter of filter. The assay 
cocktail consists of IX cloned Pfu (cPfu) magnesium free buffer (IX buffer is 20 mM Tris-HCl 
(pH 8.8), 10 mM KC1, 10 mM (NH4) 2 S0 4? 100 ug/ml bovine serum albumin (BSA), and 0.1% 
Triton X-100; Pfu Magnesium-free buffer may be obtained from Stratagene (Catalog No. 
200534)), 125 ng/ml activated calf thymus or salmon sperm DNA, 1.29 jiCi/ml a- 33 P ddNTP or 
dideoxynucleotides (at a dNTP:dye-ddNTP ratio of 1 : 1 5). Initial screening was done in the 
presence of MnCfe, but the preferred method was to screen in IX Taq Polymerase buffer (1.5 
mM MgCl 2 ) The filters are placed between plastic wrap and a glass plate and then incubated at 
65°C for one hour, and then at 70°C for one hour and fifteen minutes. Filters are then washed 
three times in 2X SSC for five minutes per wash before rinsing twice in 100% ethanol and 
vacuum drying. Filters are then exposed to X-ray film (approximately 16 hours), and plaques 
that incorporate label are identified by aligning the filters with the original plate bearing the 
phage clones. Plaques identified in this way are re-plated at more dilute concentrations and 
assayed under similar conditions to allow the isolation of purified plaques. 

In assays such as the one described above, the signal generated by the label is a direct 
measure of the activity of the polymerase with regard to that particular unconventional 
nucleotide or combination of unconventional nucleotides used in the assay. Unconventional 
nucleotides corresponding to all four conventional nucleotides may be included in the reactions, 
or, alternatively, only one unconventional nucleotide may be included to assess the effect of the 
mutation(s) on utilization of a given unconventional nucleotide. One approach is to use 
unconventional nucleotides corresponding to all four nucleotides in a first screen to identify 
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clones that incorporate more than a reference wild-type clone, and then to monitor the 
incorporation of individual unconventional nucleotides in a subsequent screen. In the preferred 
screening mode, only the dideoxynucleotides and dideoxynucleotide analogs of ddATP, ddCTP, 
and ddTTP would be used since ddGTP is not discriminated against by some DNA polymerases 
and increases the background signal of any screen 

In order to screen for clones with enhanced ability to incorporate dideoxynucleotides, 
clones identified in first screens utilizing only dideoxynucleotides may then be characterized by 
their sensitivity to low levels of each of the four dideoxynucleotides in a DNA polymerase 
nucleotide incorporation assay employing all four dNTPs, a ;H-TTP tracer, and a low level of 
each ddNTP. Since incorporation of dideoxynucleotides stops DNA chain elongation, superior 
ability to incorporate dideoxynucleotides diminishes the incorporation of tritium labeled 
deoxynucleotides relative to wild-type DNA polymerase. Comparisons of ddNTP concentrations 
that bring about 50% inhibition of nucleotide incorporation (ho%) can be used to compare ddNTP 
incorporation efficiency of different polymerases or polymerase mutants. Comparisons of Iso% 
values for ddATP, ddCTP, ddGTP, and ddTTP can be used to identify mutants with reduced 
selectivity for particular bases. Such mutants would be expected to produce more uniform DNA 
sequencing ladders. 

In order to measure incorporation of individual ddNTPs, cocktails are prepared which 
consist of varying concentrations of the ddNTP of interest, and a total of 200jiM of each 
nucleotide triphosphate. For example, the incorporation of ddATP by wild type JDF-3 
polymerase may be measured at 0, 40, 80, 120 and 160jiM ddATP. In these reactions, dATP 
concentrations would be adjusted to 200, 160, 120, 80, and 40p,M, respectively, so that the total 
amount of adenine nucleotide triphosphate is 200juM. In comparison, mutants may be assayed 
using ddATP concentrations of 0, 5, 10, and 20)iiM ddATP, and adjusted dATP concentrations of 
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200, 195, 1 90, and 180|iM, respectively (dATP + ddATP = 200jiM). Additional cocktails are 
prepared to similarly measure ddCTP, ddGTP, and ddTTP incorporation. 

Incorporation of nucleotides under the concentration parameters described above may 
be measured in extension reactions by adding, for example, 1 jxl of appropriately diluted bacterial 
extract (i.e., heat-treated and clarified extract of bacterial cells(see Example 1, part M) 
expressing a cloned polymerase or mutated cloned polymerase) to 10 ul of each nucleotide 
cocktail, followed by incubation at 72°C for 30 minutes. Extension reactions are quenched on 
ice, and then 5|ul aliquots are spotted immediately onto DE81 ion-exchange filters (2.3cm; 
Whatman #3658323). Unincorporated label is removed by 6 washes with 2xSCC (0.3M NaCl, 
30mM sodium citrate, pH 7.0), followed by a brief wash with 100% ethanol. Incorporated 
radioactivity is then measured by scintillation counting. Reactions that lack enzyme are also set 
up along with sample incubations to determine "total cpms" (omit filter wash steps) and 
"minimum cpms"(wash filters as above). 

Cpms bound is proportional to the amount of polymerase activity present per volume 
of bacterial extract. The volume of bacterial extract (generally about 0.25-ljil) which brings 
about incorporation of approximately 10,000 cpms is determined for use in subsequent 
nucleotide analog incorporation testing. 

Genes for mutant DNA polymerases generated by random mutagenesis may be 
sequenced to identify the sites and number of mutations. For those mutants comprising more 
than one mutation, the effect of a given mutation may be evaluated by introduction of the 
identified mutation to the exo" progenitor gene by site-directed mutagenesis in isolation from the 
other mutations borne by the particular mutant. Screening assays of the single mutant thus 
produced will then allow the determination of the effect of that mutation alone. 
F. Expression of Mutated Family B DNA Polymerase According to the Invention 
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Methods known in the art may be applied to express and isolate the mutated forms of 
Family B DNA polymerase according to the invention. Many bacterial expression vectors 
contain sequence elements or combinations of sequence elements allowing high level inducible 
expression of the protein encoded by a foreign sequence. For example, as mentioned above, 
bacteria expressing an integrated inducible form of the T7 RNA polymerase gene may be 
transformed with an expression vector bearing a mutated DNA polymerase gene linked to the T7 
promoter. Induction of the T7 RNA polymerase by addition of an appropriate inducer, for 
example, isopropyl-P-D-thiogalactopyranoside (IPTG) for a lac-inducible promoter, induces the 
high level expression of the mutated gene from the T7 promoter (see Gardner & Jack, 1999, 
supra). 

Appropriate host strains of bacteria may be selected from those available in the art by 
one of skill in the art. As a non-limiting example, E. coli strain BL-21 is commonly used for 
expression of exogenous proteins since it is protease deficient relative to other strains of E. coll 
BL-21 strains bearing an inducible T7 RNA polymerase gene include WJ56 and ER2566 
(Gardner & Jack, 1999, supra). For situations in which codon usage for the particular 
polymerase gene differs from that normally seen in E. coli genes, there are strains of BL-21 that 
are modified to carry tRNA genes encoding tRNAs with rarer anticodons (for example, argU, 
ileY, leuW, and proL tRNA genes), allowing high efficiency expression of cloned protein genes, 
for example, cloned archaeal enzyme genes (several BL21 -CODON PLUS™ cell strains 
carrying rare-codon tRNAs are available from Stratagene, for example). 

There are many methods known to those of skill in the art that are suitable for the 
purification of a modified DNA polymerase of the invention. For example, the method of 
Lawyer et al. (1993, PCR Meth. & App. 2: 275) is well suited for the isolation of thermostable 
DNA polymerases expressed in E. coli, as it was designed originally for the isolation of Taq 
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polymerase. Alternatively, the method of Kong et al (1993, J. Biol. Chem. 268: 1965, 
incorporated herein by reference) may be used, which employs a heat denaturation step to 
destroy host proteins, and two column purification steps (over DEAE-Sepharose and heparin- 
Sepharose columns) to isolate highly active and approximately 80% pure thermostable DNA 
polymerase. Further, as detailed in Example 1 , part N, below, DNA polymerase mutants may be 
isolated by an ammonium sulfate fractionation, followed by Q Sepharose and DNA cellulose 
columns, or by adsorption of contaminants on a HiTrap Q column, followed by gradient elution 
from a HiTrap heparin column. 

G. Preparation of Thermococcus species JDF-3 Thermostable DNA Polymerase With Reduced 
Discrimination 

To prepare thermostable Family B polymerases which exhibit reduced discrimination 
for dideoxynucleotide triphosphates (ddNTPs), the DNA sequence encoding a 3' to 5' 
exonuclease- deficient (D141 A) Family B polymerase from the hyperthermophilic archaeon 
Thermococcus species JDF-3 was subjected to random mutagenesis using "error-prone PCR" as 
described herein, and cloned into the bacteriophage lambda Zap® II. The polymerase from JDF- 
3 was chosen due to superior processivity, polymerization rate and ddNTP incorporation relative 
to the Family B DNA polymerase from Pyro coccus furiosus (Pfu) (see Table IV, below). The 
library of mutants was plated on E. coli hosts and the proteins present in the lytic plaques were 
transferred to a solid support that was then immersed in a buffer containing DNA template and 
all four a- P labeled dideoxynucleotides. Mutants that incorporated the labeled 
dideoxynucleotide produced signals that corresponded to their ability to incorporate the a- P 
ddNTPs. Isolated clones were then characterized by their sensitivity to low levels of each of the 
four dideoxynucleotides in a DNA polymerase nucleotide incorporation assay employing all four 
dNTPs and a ;H-TTP tracer. Since incorporation of dideoxynucleotides stops DNA chain 
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elongation, superior ability to incorporate dideoxynucleotides diminishes the incorporation of 
tritium labeled deoxynucleotides. The unmutated progenitor DNA polymerase rarely 
incorporates dideoxynucleotides and is only 50% inhibited at high ddNTP levels (100-160 
micromolar each ddNTP). The mutant enzymes show 50% inhibition at 5 to 40 micromolar 
concentrations of ddNTP and improved incorporation was observed for all four ddNTPs (ddATP ? 
ddCTP, ddTTP and ddGTP; see Tables V and VI in Example 1, below). 

The incorporation of non-conventional nucleotides was also evaluated through use of 
purified mutant polymerases in cycle sequencing, with a- P labeled ddNTPs present at 
0.021 jiM and dNTPs present at 2.1|uM each. The mutants readily utilized all four 
rf dideoxynucleotides and produced sequencing ladders that compared favorably to Thermo 
% Sequenase®, which uses an F667Y Taq DNA polymerase mutant (VanderHorn et al., 1997, 
S BioTechniques 22: 758). 

The domains of relevance in 17 of the 40 purified mutants were sequenced. Most 

3 randomly mutated clones contained more than one mutation in the regions sequenced but all 

4 mutants contained mutations at one of three sites. Mutations predicted to confer an enhanced 
^ ddNTP uptake phenotype were introduced into the progenitor exonuclease deficient DNA 

polymerase sequence by site-directed mutagenesis to eliminate ancillary mutations which were 
not expected to contribute to the improved dideoxynucleotide uptake phenotype. 

Sixteen of the seventeen JDF-3 DNA polymerase mutations were found in region II 
(motif A) on either side of the tyrosine in the consensus sequence 404 DxxSLYPSII 413. These 
mutations consisted of DFRSLYLSII (P410L), DFRSHYPSII (L408H) and DFRSFYPSII 
(L408F). Therefore, the LYP motif of region II appears to be important in ddNTP discrimination 
in the JDF-3 Family B polymerase. 
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The prior art modification of the tyrosine corresponding to Y409 in JDF3 Family B 
DNA polymerase is recognized for its positioning in the nucleotide binding pocket. As shown 
herein, however, modification of the residues neighboring Y409 (L408H or L408F or P410L) 
had the unexpected effect of profoundly altering nucleotide binding, particularly with respect to 
ddNTP incorporation. 

The only JDF-3 DNA polymerase mutation leading to enhanced incorporation of non- 
conventional nucleotides occurring outside of region II is an alanine (ala or A) to threonine (thr 
or T) conversion at position 485 in region III (A485T), This site is two residues upstream of 
KX 3 NSXYG (Jung et al., 1990, supra; Blasco et al, 1992, supra; Dong et al., 1993, J. Biol. 
Chem. 268:21 163; Zhu et al, 1994, Biochem. Biophys. Acta 1219:260; Dong and Wang, 1995, 
J. Biol. Chem. 270:21563) (referred to as region III or motif B) which is functionally, but not 
structurally (Wang et al., 1997, supra), analogous to KX 3 (F/Y)GX 2 YG in helix O of the Family 
A DNA polymerases. In Family A DNA polymerases, such as the Klenow fragment and Taq 
DNA polymerases, the O helix contains amino acids that play a major role in dNTP binding 
(Astatke et al., 1998, J. Mol. Biol. 278:147; Astatke et al, 1995, J. Biol. Chem. 270:1945; 
Polesky et al, 1992, J. Biol. Chem 267:8417; Polesky et al, 1990, J. Biol. Chem. 265:14579; 
Pandey et ah, 1994, J. Biol. Chem. 269:13259; Kaushik et al, 1996, Biochem. 35:7256). 
Specifically, helix O contains the F (F762 in the Klenow fragment; F667 in Taq) which confers 
ddNTP discrimination in Family A DNA polymerases (KX 3 (F/Y)GX 2 YG) (Tabor and 
Richardson, 1995, supra). 

The effect of the A485T mutation on ddNTP incorporation in the JDF-3 DNA 

polymerase is surprising since the RB69 and Thermococcus gorgonarius crystal structures 

(Hopfher et al., 1999, supra) show it facing away from the proposed active site of the nucleotide 

binding surface. Moreover, the type of side chain conferring ribose selectivity in archaeal 
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Family B DNA polymerases (A: small, non-polar) is different from that of the bulky, aromatic Y 
and F residues that dictate ddNTP discrimination in Family A DNA polymerases (Tabor and 
Richardson, 1995, supra). Additionally, this position (A485) is not well conserved among either 
DNA polymerase family and is not included in the consensus sequence for this domain 
(Braithwaite and Ito, 1993, supra), implying a lack of critical importance in dNTP recognition. 

A JDF-3 double mutant was constructed that contains mutations P410L and A485T. 
In dideoxynucleotide cycle sequencing, the banding pattern intensity demonstrated by the double 
mutant was extremely uniform, suggesting little if any preference for any dNTP over its 
corresponding ddNTP (See Figure 8 and Example 1Q). This polymerase characteristic improves 
i the accuracy of base calling in automated sequencing. We presume that combinations of P410L 
£ and A485 mutations, L408H and A485 mutations, and L408F and A485 mutations would result 
in enzymes that exhibit improved ddNTP incorporation. The efficiency of dideoxynucleotide 
incorporation by such double mutant enzymes may also be characterized or quantitated by 

3 measurement of the I 5 o% as described herein to determine the relative degree of improvement in 

4 incorporation. 

EXAMPLES 

The following examples are offered by way of illustration only and are by no means 
intended to limit the scope of the claimed invention. 
EXAMPLE 1. 

A. Cloning a DNA polymerase gene from Thermococcus species JDF-3 DNA polymerase. 

A Thermococcus species was cultured from submarine samples taken from the Juan de 
Fuca ridge. Genomic DNA was isolated and used to prepare a genomic DNA library in ZAP II 
(Stratagene) using standard procedures. The lambda library was plated on XL 1 -Blue MRF ? E. 
coli and screened for clones with DNA polymerase activity using a variation of the method 
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described by Sagner et al (Sagner, G. ? Ruger, R., and Kessler, C. (1991) Gene 97:1 19-123). 
Plaques containing active polymerase were cored and stored in SM buffer. Positive primary 
plaques were re-plated and re-assayed to allow purification of isolated clones. Secondary clones 
were excised according to the instructions provided with the ZAP II system (Stratagene), and the 
DNA sequence of the insert determined (Figure 1). 

The translated amino acid sequence of the JDF-3 DNA polymerase is shown in Figure 
2. Amino acid sequence alignments show that JDF-3 DNA polymerase exhibits homology to the 
class of DNA polymerases referred to as Family B. 

Recombinant JDF-3 DNA polymerase was purified as described below (see 
"Purification of JDF-3" (method 1)). The biochemical properties of JDF-3 DNA polymerase 
have been compared to those of other commercially available archaeal DNA polymerases. The 
results shown in Table IV and V indicated that, compared to other enzymes, JDF-3 exhibits 
higher processivity, a faster polymerization rate (K cat ), and a greater tendency to utilize ddNTPs. 
JDF-3 DNA polymerase was therefore chosen for development of a DNA sequencing enzyme. 

Table IV. Polymerase Activities of Archaeal Family B DNA Polymerases 

Polymerase Specific Activity 
(U/mg)xl0 4 

Activated DNA Primed Ml 3 DNA dNTP 

(nM) (jwM each) 

~ 2.6 ±.07 4.1 ±.07 2.0 ± .02 07 16 + 2 

exo'P/w 23 0.5 12 

JDF-3 1.2 ±.07 5.2 2.0 16 ±2 

Vent 1.8 a 0.7 a 0.1 a 57 a 

a H. Kong, R.B. Kucera, and W.E Jack, J. Biol. Chem . 268. 1965 (1993). 

B. Intern removal from the gene encoding JDF-3 DNA polymerase. 

By alignment to Family B DNA polymerase sequences, the JDF-3 DNA polymerase 

clone was found to contain an intein sequence (Figures 3 and 4). To improve expression of 
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recombinant JDF-3 polymerase, the intein was removed by inverse PGR. PGR primers were 

designed to prime immediately upstream and downstream to the sequence coding for the intein 

termini, and were oriented such that the 3' ends of the primers were pointed away from the 

intein. The primers were also modified with 5' -phosphate groups to facilitate ligation. The 

plasmid/insert sequence was PCR amplified and circularized by standard methods. 

C. Construction of a JDF-3 DNA polymerase mutant with diminished 3'-5' exonuclease 

activity. 

DNA polymerases lacking 3 '-5' exonuclease (proofreading) activity are preferred for 
applications requiring nucleotide analog incorporation (e.g., DNA sequencing) to prevent 
removal of nucleotide analogs after incorporation. The 3 '-5' exonuclease activity associated with 
proofreading DNA polymerases can be reduced or abolished by mutagenesis. Sequence 
comparisons have identified three conserved motifs (exo I, II, III) in the 3 '-5' exonuclease 
domain of DNA polymerases (reviewed V. Derbyshire, J.K. Pinsonneault, and CM. Joyce, 
Methods Enzvmol . 262 , 363 (1995)). Replacement of any of the conserved aspartic or glutamic 
acid residues with alanine has been shown to abolish the exonuclease activity of numerous DNA 
polymerases, including archaeal DNA polymerases such as Vent (H. Kong, R.B. Kucera, and 
W.E. Jack, J. Biol. Chem . 268 , 1965 (1993)) and Pfu (Stratagene, unpublished). Conservative 
substitutions lead to reduced exonuclease activity, as shown for mutants of the archaeal 9° N-7 
DNA polymerase (M.W. Southworth, H. Kong, R.B. Kucera, J. Ware, H. Jannasch, and F.B. 
Perler, Proc. Natl. Acad. Sci . 93, 5281 (1996)). 

JDF-3 DNA polymerase mutants exhibiting substantially reduced 3 '-5' exonuclease 
activity were prepared by introducing amino acid substitutions at the conserved 1 4 ID or 143 E 
residues in the exo I domain. Using the CHAMELEON® Double-Stranded, Site-Directed 
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Mutagenesis Kit (Stratagene), the following JDF-3 mutants were constructed: D141 A, D141N, 
D141S, D141T, D141E and E143A. 

To analyze JDF-3 mutant proteins, the DNA sequence encoding JDF-3 DNA polymerase 
was PCR amplified using primers GGG AAA CAT ATG ATC CTT GAC GTT GAT TAC 
(where Ndel site in bold and start codon underlined) and GGG AAA GGA TCC TCA CTT CTT 
CTT CCC CTT C (where BamHI site shown in bold type). The PCR products were digested, 
purified, and ligated into a high expression level vector using standard methods. Plasmid clones 
were transformed into BL21(DE3). Recombinant bacterial clones were grown using standard 
procedures and JDF-3 polymerase mutants were expressed in the absence of induction. The 
exonuclease and polymerase activities of recombinant clones were assayed using bacterial 
lysates. Typically, crude extracts were heated at 70°C for 15-30 minutes and then centrifuged to 
obtain a cleared lysate. 

There are several methods of measuring 3 ? to 5 J exonuclease activity known in the art, 
including that of Kong et al. (Kong et al, 1993, J. Biol. Chem. 268: 1965) and that of 
Southworth et al. (Southworth et al., 1996, Proc. Natl. Acad. Sci. U.S.A. 93: 5281), the full 
contents of both of which are hereby incorporated by reference. The exonuclease activity of wild 
type and active mutant polymerases as measured by the Kong et al. method were as follows: 

Exo activity (U/mg): 

Wt 915 

D141A7 

D141N953 

D141S954 

D141T0.5 
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D141E940 



E143A0.3 



The combination exonuclease mutant D141 A+E143A was made as described in section 

L. 

The E143A JDF-3 mutant (clone #550) exhibited significantly reduced 3'-5' exo activity 
and was chosen for further mutagenesis to improve incorporation of ddNTP and other nucleotide 
analogs. Other JDF-3 mutants with substantially reduced exonuclease activity could have been 
used for this purpose, such as the JDF-3 D141T mutant, for experiment or applications requiring 

3 the absolute elimination of 3' to 5' exonuclease activity, the double mutant D 141 A + El 43 A was 
11 preferred. 

^ Error-prone PCR amplification of the JDF-3 DNA polymerase gene. 

Random mutations were introduced into exo" JDF-3 by amplifying the entire gene (clone 

#550) under conditions which did not support high fidelity replication. To broaden the spectrum 
of potential mutations, three different PCR enzymes were used under error-prone conditions. 

4 In the preferred mode, ten reactions of 1 OOjil each were amplified with each PCR 
3 enzyme. 

i. Amplifiction with Taq DNA polymerase: 
Reaction Mixture 

lx magnesium free Taq Buffer (Stratagene catalog #200530) 



ImM 



each TTP and dCTP 



0.2 mM 



each dGTP and dATP 



2ng/fil 



Primer 923 (also called 490) 



2ng/|xl Primer 721 



0.05u/[il 



Taq2000 (Stratagene catalog #600195) 



-58- 



1.5mMMgCl 2 

0.5mM MnCl 2 

O.lpM plasmid DNA (clone #550) 

Cycling Parameters 

PCRs were carried out using Stratagene's ROBOCYCLER™40 Temperature Cycler with 
a Hot Top assembly. The following cycling conditions were used: 

1) 95°C for 1 minute 

2) 95°C for 1 minute 

3) 54°C for 1 minute 

4) 72°C for 2.5 minutes 

5) Repeat steps 2 to 4 thirty times. 

ii. Amplification with exo" JDF-3 DNA polymerase 
Reaction Mixture 

lx magnesium free Taq Buffer (Stratagene catalog #200530) 

450^Meach deoxynucleotide (dGTP, dATP, TTP and dCTP) 
2ng/jil Primer 923 (also called 490) 
2ng/|Lil Primer 721 

0. Iu/jllI exo" JDF-3 DNA polymerase 

0.5mMMnCl 2 

0. lpM plasmid DNA (clone #550) 

Cycling Parameters 

PCRs were carried out using Stratagene's ROBOCYCLER™40 Temperature Cycler with 
a Hot Top assembly. The following cycling conditions were used: 
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1) 95°C for 1 minute 

2) 95°C for 1 minute 

3) 54°C for 1 minute 

4) 72°C for 2.5 minutes 

5) Repeat steps 2 through 4 thirty times, 
iii. Amplifiction with exo" Pfu DNA polymerase 
Reaction Mixture 

lx TAQPLUS® Precision Buffer (Stratagene catalog #600210) 

200uMeach deoxynucleotide (dGTP, dATP, TTP, dCTP) 
2ng/ul Primer 923 (also called 490) 

2ng/fil Primer 721 

0.05u/ul exo" Pfu DNA polymerase (Stratagene catalog number 600163) 
0. lpM plasmid DNA (clone #550) 

Cycling Parameters 

PCRs were carried out using Perkin-Elmer's 9600 Temperature Cycler. The following 
cycling conditions were used: 

1) 95°C for 1 minute 

2) 95°C for 1 minute 

3) 53°C for 1 minute 

4) 72°C for 5 minutes 

5) Repeat steps 2 through 4 thirty times. 
Forward Primers 
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Earlier versions of the mutant libraries were made with the forward primer 461, which 
contains an EcoR I site. When products amplified with primers 461 and 923 were restriction 
digested and cloned into the lambda vector as described in the following section, JDF-3 DNA 
polymerase was synthesized as a fusion protein with the first 39 amino acids of the vector- 
encoded (3-galactosidase (lacZ) protein. 

Primer 461 5 5 TCAGATGAATTCGAT GATCCTTGACGTTGATTAC 3' 

EcoR I JDF-3 specific sequence 

The clones isolated using primer 461 were designed as p#. 

The preferred mode of amplification and cloning utilizes the forward primer 721, which 
also contains an EcoR I site followed by three consecutive in-frame stop codons and a ribosome 
binding site. This arrangement allows the JDF-3 DNA polymerase to be translated without any 
vector-derived residues at the amino terminus. The clones isolated from libraries constructed 
with the forward primer 721 were designated as l-# to differentiate them from the p# series of 
clones. 
Primer 721 

5' GAGAGAATTCA TAATGATAA GGAGGAAAAAAT TATGATCCTTGACGTTGATTAC 3' 

EcoR I 3x STOP JDF-3 specific sequence 

Reverse Primers 

Primer 923 (490) 5' TCAGATCTCGAG TCACTTCTTCTTCCCCTTC 3' 

Xho I JDF-3 specific sequence 
E. Preparing PCR products for cloning. 

PCR products were purified and concentrated with the STRATAPREP™ PCR 
Purification kit (Stratagene catalog number 400771). The PCR products were then digested with 
50 units of Xho I and 50 units of EcoR I in 1.5x Universal buffer (lOx Universal Buffer: 1M 
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KOAc, 250mM Tris-Acetate (pH 7.6), lOOmM MgOAc, 5mM p-mercaptoethanol and 100 
jug/ml BSA) for one hour at 37°C. The digested samples were run on a 1% agarose, lx TBE gel 
and visualized with ethidium bromide staining. The 2,3kb amplification product was gel isolated 
and purified with the STRATAPREP™ DNA Gel Extraction Kit (Stratagene catalog number 
400766). 

F. Cloning PCR Inserts into the Uni-Zap®XR Lambda Vector. 

200ng of purified amplification product was ligated with ljig of UNI-ZAP®XR Lambda 
Vector (Stratagene catalog #239213), which had been predigested with EcoR I and Xho I and 
then dephosphorylated with alkaline phosphatase (Stratagene catalog number 23721 1). The 
DNAs were ligated using 2 units of T4 DNA ligase (Stratagene catalog number 60001 1) and 
0.5mM ATP in lx ligase buffer (50mM Tris-HCL (pH 7.5), 7mM MgCl 2 , ImM DTT) in reaction 
volumes of 10 to 15jaL Ligations were carried out at 16°C for a minimum of 16 hours. 

G. Lambda Packaging and Bacterial Infection. 

Two microliters of each ligation reaction were packaged with GIGAPACK® III Gold 

Packaging extract (Stratagene catalog #200201) for 90 minutes at room temperature before being 
stopped with 500jJ SM buffer (50mM Tris pH 7.5, lOOmM NaCl, 8mM MgS0 4 and 0.01% 
gelatin) and 20jli1 of chloroform. The packaged lambda vectors were plated on E. coli XLl-Blue 
MRF' host cells. 

H. Dideoxynucleotide Screening. 

Mutant polymerase libraries were screened using a variation of the technique used by 

Sagner et al (Sagner, G. ? Ruger, R., and Kessler, C. (1991) Gene 97:1 19-123). Lambda phage 
clones were plated at a density of 10-20 plaques per square centimeter. Proteins present in the 
plaques were transferred to filters and moistened with polymerase screening buffer (50mM Tris 
(pH 8.0), 7mM MgCk, 3mM p-ME). The filters were kept between layers of plastic wrap and 
glass while the host cell proteins were heat-inactivated by incubation at 65°C for 30 minutes. The 
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heat-treated filters were transferred to fresh plastic wrap and approximately 35|il of the 
polymerase assay cocktail was added for every square centimeter of filter. Polymerase assay 
cocktail consisted of lx cloned Pfu magnesium-free buffer (Stratagene catalog #200534), 
125ng/ml activated calf thymus or salmon sperm DNA, 1.29 nCi/ml a-^P ddNTP (Amersham), 
and 0.5mM MnC^. Initial screening was done in the presence of MnCk, but the preferred 
method was to screen in IX Taq Polymerase buffer (1.5 mM MgCb). The filters were 
sandwiched between plastic wrap and glass again and incubated at 65°C for one hour, and then at 
70°C for one hour and 15 minutes. The filters were washed three times in 2x SSC for five 
minutes each time before being rinsed twice in 100% ethanol and dried on a vacuum dryer. The 
filters were exposed to X-ray film for approximately 16 hours. Plaques corresponding to strong 
signals were cored and placed in SM buffer. The positive primary plaques were replated at more 
dilute concentrations and assayed under essentially similar conditions to allow the purification of 
isolated plaques. 

Dve-dideoxvnucleotide Screening 

To detect mutant polymerases with improved capacity for dye-deoxynucleotide and dye- 
dideoxynucleotide utilization, the JDF-3 mutant DNA polymerase library was screened as 
described previously with the following exceptions: 

Polymerase assay cocktail for Flu-12-dUTP screening: 

0.9X Taq Buffer (Stratagene Catalog #200435), 65 \M dATP, 65 |uM dCTP, 65 jaM 
dGTP, 65 |LiMdTTP 5 0.3 \iM Fluoresceince-12-dUTP (Stratagene in-house production), 0.75 
jag/jLil activated calf thymus DNA. 

Polymerase assay cocktail for ROX ddNTP 

IX Taq Buffer, 0.9 [iM dATP, 0.9 jaM dCTP, 0.9 jaM dGTP, 0.9^1 TTP, 0.6 ROX 
ddATP (New England Nuclear (NEN) NEN478), 0.06 \xM ROX ddGTP (NEN NEL479), 0.06 

-63- 



|jM ROX ddCTP (NEN NEL477), 0.06 yM ROX ddUTP (NEN NEL476), 0.84 jag/jal activated 
calf thymus DNA. (Note: A screening system without ROX ddGTP is the preferred method since 
DNA polymerases do not discriminate against ddGTP). 

Polymerase assay cocktail for Fluroesceine ddUTP 

IX Taq Buffer, 70 \M dATP, 70 jiM dTTP, 70 \M dCTP, 15 \M dTTP, 1 yM 
Fluroesceine-12-ddUTP (NENNEL401), 0.84 jig/^il activated calf thymus DNA. 
Antibody binding to fluroesceine 

The filters were blocked overnight with 1% non-fat dry milk dissolved in TBST (50 mM 
Tris pH 8.0, 150 mM NaCl, 0.05% Tween-20) at 4°C. The filters were washed briefly in TBST 
before alkaline phosphatase conjugated anti-fluoresceine antibody from the Illuminator kit 
(Stratagene catalog #300360) was added at a 1/10,000 dilution in 50 ml TBST. The antibody 
was detected with NBT/BCIP at concentrations of 0.3 mg/ml and 0.15 mg/ml respectively in a 
buffer composed of 100 mM Tris pH 9.5, 100 mM NaCl, and 5 mM MgCl 2 . 

Antibody binding to Rhodamine 

Anti-ROX antibody (Zymed cat. no. 71-3600 rabbit Rhodamine (5-ROX polyclonal, 1 
mg/ml)) was diluted to 1:1000 in TBST. The blocked filters were blotted briefly to remove 
excess moisture then laid on plastic wrap and covered with 2.5 ml of the diluted antibody 
solution. An additional sheet of plastic wrap was laid over the filters before incubation at room 
temperature for 1 hour. The filters were washed briefly three times with TBST, then washed 
three times with gentle agitation for 15 minutes each time. The washed filters were incubated 
with alkaline phosphatase conjugated goat anti-rabbit antibodies diluted 1 :5000 in TBST. The 
filters were incubated with the antibody for one hour then detected with NBT/BCIP as described 
previously. 
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I. Dideoxynucleotide Qualification 

Lambda phage clones which incorporated 33 P-labeled ddNTPs in the primary library 

screen were re-screened to verify polymerase activity and to assess the contribution of the 
divalent metal ion to 33 P-ddNTP incorporation. The clones selected during this round of 
screening were designated as p#. These clones all contained an amino-terminal tag, as discussed 
in the section entitled "Forward Primers". Figure 5 shows that clones pi, p2, p3, p6, p7, p8, p9, 
plO, pi 1, pi 2, pi 4, pi 5, and pi 6 exhibited wild type levels of DNA polymerase activity, based 
upon similarity in signal strength to the parental #550 clone (Figure 5, panel 3). Although initial 
screening was carried out in the presence of 0.5mM MnCk, all of the clones except p9 and plO 
% were able to incorporate 33 P-labeled ddNTPs to at least some extent in the presence of 1 .5mM 
3 MgCl 2 (panel 2), with clones p2, p4, p8, pi 1, pl2, pl3, pl4, pl5, pl7 ? and pl8 producing the 
l| highest signals. 

^ Eighteen mutants were chosen for evaluation. One microliter of phage isolated from each 

f purified plaque was placed on each of three E. coli XL 1 -Blue MRF' lawns. Phage containing a 
iJ . parental copy of exo" JDF3 DNA (#550 clone) were also spotted on the grid. The plaques formed 
% by the phage were transferred to filters and treated as described in the preceding screening 

section with the exception of the final buffer composition. The buffers used for each filter (filters 

1-3) are as follows: 

Filter 1 : Dideoxynucleotide screen with manganese chloride 

lx Taq DNA polymerase magnesium-free buffer 

1.28 [iCi/ml 33 P ddNTPs 

0.5 H'g/P'l Activated Calf Thymus DNA (Sigma) 
0.5 mM MnCl 2 

Filter 2: Dideoxynucleotide screen with magnesium chloride 

lx Taq DNA polymerase buffer (containing 1 .5mM MgC^, catalog #200435) 
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1.28 ^Ci /ml j3 P ddNTP 

0.5 jLXg/jLxl Activated Calf Thymus DNA (Sigma) 

Filter 3: Deoxvnucleotide screen with magnesium chloride 

1 x Taq DNA polymerase buffer 

0.072 mMdGTP,dCTPandTTP 

40 ixM dATP 

0.5 |Lig/ml Activated Calf Thymus DNA (Sigma) 

0.01 jxCi a- 33 P dATP. 

Results are shown in Figure 5. 
Dve-dideoxvnucleotide Qualification 

As described in the previous segments, primary lambda clones were spotted on an E. coli 
lawn and re-screened with the appropriate antibody or antibodies. 

J. Excision of Lambda Clones, 

When incubated with helper phage under suitable conditions, Lambda Zap™ vectors are 

designed to produce phagemid copies of the part of the vector containing pBluescript (SK-) and 
the insert. This process yields a plasmid (pBluescript SK-) vector carrying the same insert that 
was contained in the lambda clone. Excision of clones with the desired phenotype was carried 
out according to the instructions in the EXASSIST™ system (Stratagene catalog #200253). 

K. Sequence Analysis of Mutants. 

The mutants were sequenced by Sequetech Corporation(Mountain View, California) 

using the following primers: 

Primer 3 (or primer G) 5' CCAGCTTTCCAGACTAGTCGGCCAAGGCC 3' 
Primer 5 (or JDF3-1 128) 5 ? AACTCTCGACCCGCTG 3' 
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L. Dideoxynucleotide mutagenesis* 

To conclusively identify the amino acids contributing to reduced ddNTP discrimination, 

individual point mutations were introduced into the exo" JDF-3 #550 clone using the 
QUIKCHANGE™ Site-Directed Mutagenesis Kit (Stratagene catalog #200518). The following 
mutants were prepared: L408H, L408F, P410L, A485T, S345P, D373Y, A619V, and L631 V. In 
addition, a double mutant (P410L/A485T) was constructed by introducing the A485T mutation 
into the exo" JDF-3 P410L mutant clone. To completely eliminate all 3' to 5' exonuclease 
activity, the mutation D141A was added to all clones. A pre-existing 5 5 to 3' exonuclease 
mutation (E143A) was present in the parental template JDF-3 550. 
Dve-dideoxvnucleotide mutagenesis 

To conclusively identify amino acids responsible for contributing to reduced 
discrimination of dye nucleotides, the mutation S345P was generated alone and in combination 
with the P410L and P410L + A485T. 

M. Preparation of heat-treated bacterial extracts. 

E. coli SOLR cells containing the excised plasmid were grown overnight at 37°C. The 

cells contained in 500jj,1 of culture were collected by microcentrifugation. The cell pellets were 
resuspended in 50jil of 50mM Tris (pH 8.0). Lysozyme was added to a final concentration of 
1 [ig/fil, and the cells were lysed during a 10 minute incubation at 37°C, followed by 10 minutes 
at 65°C. The heat-inactivated cell material was collected by microcentrifugation and the 
supernatants were assayed for dNTP and ddNTP incorporation as described below. 

N. Purification of JDF-3 and JDF-3 polymerase mutants. 

One method for purifying exo" JDF-3 DNA polymerase involves ammonium sulfate 

fractionation, followed by Q Sepharose and DNA cellulose columns. A second method has been 
developed to allow rapid purification of JDF-3 polymerase mutants, and entails adsorption of 
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contaminants on a HiTrap Q column, followed by gradient elution from a HiTrap heparin 
column (section iii). 

i. Preparation of bacterial lysate. 

Frozen cell paste (3-14 grams) was resuspended with 3x volume of lysis buffer, 
consisting of 50mM Tris-HCl (pH 8.0), 1 mM EDTA, and 10 mM (3-mercaptoethanoL 
Lysozyme was added to 0.2mg/ml and PMSF was added to ImM final concentration. The cells 
were lysed on ice over a period of 1 hour. The lysate was then sonicated for 2 minutes (90% 
duty, level of 2x2.5, 1x3.0). Following sonication, the lysate was heated at 65°C for 15 minutes 
to denature bacterial proteins. The heated lysate was then centrifuged for 30 minutes at 14.5K 
rpm in a Sorvall RC-2B centrifuge using a Sorvall SS-34 rotor, and the supernatant was 
recovered. 

ii. Ammonium sulfate fractionation and Q Sepharose/DNA cellulose chromatography (method 
1) 

Ammonium sulfate was added to the bacterial lysate to a final concentration of 45%. The 
ammonium sulfate was added over a period of 15 minutes, and the mixture was stirred for an 
additional 30 minutes. The mixture was centrifuged as described above, and the supernatant was 
recovered. Additional ammonium sulfate was then added to bring the final concentration to 65%. 
The mixture was centrifuged as described above, and the supernatant removed. The pellet was 
resuspended in 10ml of buffer A consisting of 50mM Tris-HCl (pH 7.5), 1 mM EDTA, lOmM p- 
mercaptoethanol, 0.1% (v/v) Tween 20, and 10% (v/v) glycerol. The supernatant was dialyzed 
overnight against 2 changes of buffer A (3 liters each). 

The dialysate was loaded onto a 2.6 x 9.4cm Q-Sepharose Fast Flow column ( 50mls), 
pre-equilibrated in buffer A. The column was washed with buffer A until the absorbence (OD280) 
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approached baseline. The column was then eluted with a gradient from 0 to 1M NaCl/buffer A. 
Fractions were collected, and analyzed by SDS-PAGE and DNA polymerase activity assays (see 
below). Active protein typically eluted between 130 and 240mM NaCl. Active fractions were 
pooled and dialyzed overnight against 2 changes of buffer B (3 liters each), consisting of 50mM 
Tris-HCl (pH 7.5), 1 mM EDTA, lOmM (3-mercaptoethanol, 0.1% (v/v) Tween 20, 
10%(v/v)glycerol, and 50mM NaCl. 

The Q-Sepharose eluate was then loaded onto a 1.6 x 4.9cm ( lOmls) DNA cellulose 
column, equilibrated in buffer B. The column was washed with buffer B until the absorbence 
(OD280) approached baseline. The column was then eluted with a gradient from 50 to lOOOmM 
NaCl/buffer A. Fractions were collected, and analyzed by SDS-PAGE and DNA polymerase 
activity assays. Active protein typically eluted between 280 and 360mM NaCl. Active fractions 
were pooled and dialyzed overnight against JDF-3 final dialysis buffer, consisting of 25mM Tris- 
HCl (pH 7.5), lOOmM KC1, O.lmM EDTA, ImM DTT, 0.1% (v/v) Tween 20, 0.1% (v/v) Igepal 
630, 10ng/ml BSA, and 50% (v/v) glycerol, 
iii. HiTrap Q/HiTrap heparin chromatography (method 2) 

The preferable method for rapid purification of multiple mutants is as follows. Bacterial 
cell ly sates were prepared as described for method 1, except that Tween 20 and Igepal CA 630 
were added to a final concentration of 0.0 1 % (v/v) just prior to the heat denaturation step, and a 
heat denaturation temperature of 72 °C was used. 

The lysate was loaded onto a 1 .6 x 2.5cm ( 5mls) HiTrap Q column (pre-packed column 
from Pharmacia), pre-equilibrated in buffer C consisting of 50mM Tris-HCl (pH 8.2), ImM 
EDTA, lOmM (3-mercaptoethanol, 0.1% (v/v) Tween 20, and 0.1% (v/v) Igepal CA 630. The 
column was washed with buffer C until the absorbence (OD280) approached baseline. The flow 
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through fractions (OD280 absorbence above background) were collected and then loaded onto a 
1 .6 x 2.5cm ( 5mls) HiTrap heparin column (pre-packed column from Pharmacia), pre- 
equilibrated in buffer D consisting of 50mM Tris-HCl (pH 8.2), ImM EDTA, ImM DTT, 0.1% 
(v/v) Tween 20, 0.1% (v/v) Igepal CA 630, and 10% glycerol (v/v). The column was washed 
with buffer D until the absorbence (OD280) approached baseline. The column was then eluted 
with a gradient from 0 to 1M KCl/buffer D. Fractions were collected, and analyzed by SDS- 
PAGE and DNA polymerase activity assays. Active protein typically eluted between 390 and 
560mM NaCl. Active fractions were pooled and dialyzed overnight against JDF-3 final dialysis 
buffer (see above). Purified polymerases were stored at -20 C. 
iv. Analysis of Purified Proteins 

The concentrations of JDF-3 and mutant DNA polymerases were determined relative to a 
BSA standard (Pierce), using Pierce's Coumassie Blue Protein assay reagent. In addition, the 
purity and relative protein concentrations of different polymerase preparations were verified by 
SDS-PAGE. Polymerase samples were electrophoresed on 4-20% Tris-glycine gels (Novex), and 
the gels were silver-stained using standard procedures. 

O. Nucleotide Incorporation Assay. 

DNA polymerase activity was measured using purified JDF-3 polymerase mutants or 

heat-treated bacterial extracts prepared from various mutant clones. DNA polymerase activity 
was measured by monitoring the incorporation of 3 H-TTP into activated calf thymus DNA. A 
typical DNA polymerase reaction cocktail contained: 

lOmM Tris-HCl, pH 8.8 

1.5mM MgCl 2 

50mM KC1 

0.001% gelatin 
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200]iM each dATP, dCTP, dGTP 
195juM TTP 

5^M [ 3 H]TTP (NEN #NET-221H, 20.5Ci/mmole; partially evaporated to remove 
EtOH) 

250jig/ml of activated calf thymus DNA (e.g., Pharmacia #27-4575-01) 
Incorporation was measured by adding 1 jllI of polymerase samples to lOfxl aliquots of 
polymerase cocktail DNA polymerase samples were diluted in a suitable storage buffer (e.g., 
25mM Tris-HCl (pH 7.5), lOOmM KC1, O.lmM EDTA, ImM DTT, 0.1% (v/v) Tween 20, 0.1% 
(v/v) Igepal 630, 10ju,g/ml BSA, and 50% (v/v) glycerol). Polymerization reactions were 
conducted for 30 minutes at 72°C. Extension reactions were quenched on ice, and then 5jil 
aliquots were spotted immediately onto DE81 ion-exchange filters (2.3cm; Whatman #3658323). 
Unincorporated [ 3 H]TTP was removed by 6 washes with 2xSCC (0.3M NaCl, 30mM sodium 
citrate, pH 7.0), followed by a brief wash with 100% ethanol. Incorporated radioactivity was 
measured by scintillation counting. Reactions that lacked enzyme were also set up along with 
sample incubations to determine "total cpms" (omit filter wash steps) and "minimum 
cpms"(wash filters as above). 

Cpms bound is proportional to amount of polymerase activity present per volume of 
bacterial extract. The volume of bacterial extract (0.25- ljal) which brought about incorporation 
of approximately 10,000 cpms was determined for use in subsequent nucleotide analog 
incorporation testing. 

P. Quantitating ddNTP incorporation efficiency. 

JDF-3 polymerase mutants were evaluated to assess relative ddNTP incorporation 

efficiency. Nucleotide incorporation was measured in the presence of varying concentrations of 
each ddNTP terminator (ddATP, ddCTP, ddGTP, and ddTTP). Since ddNTP incorporation 
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produces non-extendable termini, polymerization is strongly inhibited for polymerases that 
incorporate ddNTPs efficiently. Comparisons of ddNTP concentrations that bring about 50% 
inhibition of nucleotide incorporation (150%) can be used to compare ddNTP incorporation 
efficiency of different polymerases or polymerase mutants. Comparisons of 150% values for 
ddATP, ddCTP, ddGTP, and ddTTP can be used to identify mutants with reduced selectivity for 
particular bases. Such mutants would be expected to produce more uniform DNA sequencing 
ladders. 

To measure incorporation of individual ddNTPs, cocktails were prepared which consisted 
of varying concentrations of the ddNTP of interest, and a total of 200juM of each nucleotide 
triphosphate. For example, the incorporation of ddATP by wild type JDF-3 polymerase was 
measured at 0, 40, 80, 120 and 160jliM ddATP. In these reactions, dATP concentrations were 
adjusted to 200, 160, 120, 80, and 40]uM, respectively, so that the total amount of adenine 
nucleotide triphosphate was 200jnM. In comparison, mutants were assayed using ddATP 
concentrations of 0, 5, 10, and 20pM ddATP, and adjusted dATP concentrations of 200, 195, 
190, and 180|uM, respectively (dATP + ddATP = 200juM). Additional cocktails were prepared to 
measure ddCTP, ddGTP, and ddTTP incorporation. To assess ddNTP incorporation by JDF-3 
mutants at 3 different ddNTP concentrations, 12 reaction cocktails were prepared consisting of: 

lOmMTris-HCl, pH 8.8 

L5mM MgCl 2 

50mM KC1 

0.001% gelatin 

5pM [ 3 H]TTP (NEN #NET-221H, 20.5Ci/mmole; partially evaporated to remove EtOH) 
250jag/ml of activated calf thymus DNA (e.g., Pharmacia #27-4575-01) 
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To each of 12 reaction cocktails was added the appropriate amounts of dNTPs and 
ddNTPs as summarized below: 
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Incorporation was measured by adding ljul of appropriately diluted bacterial extract 
(10,000 cpms) to lOjil of each polymerase cocktail. Polymerization reactions were conducted for 
30 minutes at 72°C. The extension reactions were counted as described above. 

Reactions that lacked enzyme were also set up along with sample incubations to 
determine "minimum cpms"(wash filters as above). To determine % activity as a function of 
ddNTP concentration, background ("minimum cpms" value) was first subtracted from each of 
the sample cpms. "Total cpms", which are equivalent to 100% activity (0 ddNTPs), are 
determined by averaging the corrected cpms for the 4 reactions lacking ddNTPs (A-0, G-0, C-0, 
and T-0). Percent remaining activity was then calculated by dividing corrected sample cpms 
(with ddNTPs) by the corrected total cpms (average 0 ddNTPs). 

Percent activity was plotted as a function of ddNTP concentration. I 5 o% values for each 
ddNTP (ddNTP concentration which inhibits nucleotide incorporation by 50%) were determined 
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for each mutant. Comparisons allowed the identification of mutants with improved ddNTP 
incorporation relative to wild type JDF-3. 

Initial studies used purified enzymes, and l$o% values were determined from inhibition 
plots employing 40-160^iM ddNTPs. The results in Table V show that mutants p8 (P410L), pi 1 
(P410L), and pl2(A485T) are inhibited by lower concentrations of ddNTPs than the parental 
exo" JDF-3 polymerase. Greater sensitivity indicates that the mutants incorporate all four 
ddNTPs more efficiently than the original JDF-3 polymerase. 

For enzymes which preferentially incorporate TTP over ddTTP (exo" JDF-3 , exo" Pfu), 
the use of increasingly higher concentrations of ddTTP (80-160]uM) and correspondingly lower 

; concentrations of TTP (1 15-35jjM) ? in combination with a constant amount of [ H]TTP (5|iM), 

leads to an increase in cpms incorporated with increasing ddNTP concentration. Therefore, in 

2 these initial experiments (where ddTTP >120{iM) ? lso% values for TTP are artificially high. 
While they can be used to compare ddTTP incorporation among different polymerase mutants, 

y they can not be used to assess reduced/enhanced preference for ddTTP relative to ddCTP, 

3 ddGTP,orddATP. 

Table V I 50 % Values for Purified JDF-3 and JDF-3 Mutants. 

Purified Polymerase Primary I 50 % Values (jxM) 

Mutation 

ddATP ddGTP ddCTP ddTTP 
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To allow a larger number of mutant clones to be screened, subsequent experiments 
employed bacterial extracts containing JDF-3 polymerase mutants. In addition, sensitivity was 
improved by using lower concentrations of each ddNTP inhibitor (5-20|nM). The results in Table 
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VI demonstrate that all of the mutants selected from the primary filter screen exhibited improved 
incorporation of ddNTPs. Improvements in ddNTP incorporation were as high as >20-fold. All 
of the mutants containing a mutation at amino acid 408 (L408H/F), 410 (P410L), or 485 
(A485T) (referred to as the "primary mutation") exhibited reduced discrimination against all four 
ddNTPs. Most, but not all, mutants with the L408H/F primary mutation produced very similar 
Iso% values (<2-fold difference) for all four ddNTPs, indicating that base selectivity is diminished 
or absent. 

Table VI. I 5 o% Values for JDF-3 Mutants (Bacterial Extracts). 

I 50% Values (jiM) 
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1-26, 1-32, 1-16, 1-12 
Q. Sequencing with purified JDF-3 polymerase mutants. 

i. Sequencing with radioactively labeled dideoxynucleotides 

1 to 2jul of purified enzyme was substituted into the Thermo Sequenase radiolabeled 
terminator cycle sequencing kit (Amersham-Pharmacia #US79750). The samples were 
processed according to the manufacturer's instructions using the control primer and template 
provided with the kit. Three microliters of each sequencing reaction were loaded onto a 6% 
acylamide-7M urea, lx TBE CASTAWAY™ Precast gel (Stratagene catalog #s 401090 and 
401094). When the bromophenol blue indicator dye reached the end of the gel, the gel was 
fixed, dried and exposed to film for 24-72 hours (Figure 6). 
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The results in Figure 6 show that clones pi 1 (panel D) and p8 (panel E) exhibit a 
dramatic improvement in the incorporation of all four ddNTPs compared to the parental #550 
clone (panel B). Mutants pi 1 and p8 both contain the primary P410L mutation and an amino tag, 
but differ with respect to the number and types of ancillary mutations. Mutant pl2 (panel C) 
produced a faint sequencing ladder, presumably due to the use of an insufficient amount of 
enzyme or the presence of ancillary mutations which reduce thermal stability. There is evidence 
of termination products in all lanes, suggesting an improvement in the incorporation of all four 
ddNTPs relative to the parental clone. Mutant pl2 contains the primary mutation A485T in 
addition to ancillary mutations. In contrast to JDF-3 mutants identified here, the parental clone 
shows a strong preference to incorporate ddGTP, as evidenced both in primer extension (Figure 
6) and ddNTP inhibition assays (Tables V and VI). 

ii. Sequencing with a radioactively labeled primer and fluorescent dideoxynucleotides 

Different DNA polymerases and polymerase mutants will exhibit varying degrees of 
discrimination against the dye moieties on the dideoxynucleotide analogs. An assessment of 
usage of dye-labeled dideoxynucleotide analogs by the JDF-3 polymerase mutants was carried 
out. The procedure used was as follows; 
a. Primer Labeling 

The sequencing primer SK was radioactively labeled with the KINACE-IT™ Kinasing 
Kit (Stratagene catalog #200390). The incubation reaction (40jnl) contained the following 
components: 

IX kinase buffer #1 

0.75 jmCi/^il y- 33 P ATP 

0.375 u/jLxl T4 polynucleotide kinase 

2.5 pmol/|ul SK primer 
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The reaction was incubated at 37°C for 45 minutes. The primer was purified away from 
free nucleotides with a size exclusion matrix (NUC TRAP® Stratagene catalog number 400701). 

b. Dye labeled-dideoxynucleotide : dNTP ratios 

Fluorescent dideoxynucleotides were purchased from New England Nuclear (NEN); 

R6G-ddATP NEN catalog number NEL-490 

Rl 1 0-ddTP NEN catalog number NEL-495 

TAMRA-ddUTP NEN catalog number NEL-472 

ROX-ddCTP NEN catalog number NEL-477 

Incorporation was measured using 3 different concentrations of dye labeled 

dideoxynucleotides (ddNTPs) and a constant amount of deoxynucleotides (dNTPs; 
2.14pM): 

Condition 3) 1 : 1 (2-14pM each dNTP : 2.14pM dye-labeled ddNTP) 

Condition 2) 1 : 0.1 (2.14jiM each dNTP : 0.214pM dye-labeled ddNTP) 

Condition 1) 1 : 0.01 (2.14pM each dNTP : 0.0214]uM dye-labeled ddNTP) 

c. Preparation of the DNA Sequencing Reaction Mixtures 

Four polymerases were tested for utilization of dye-labeled ddNTPs, exo" JDF-3 (#550 
clone), Thermo Sequenase (4u/|ul) ? JDF-3 P410L (clone p8 with ancillary mutations and an 
amino-terminal tag) and JDF-3 L408H (clone 1-1). A mixture containing the following 
reagents was assembled: 

13.7^1 H 2 0 
1 jil labeled SK primer (2 pmol/jil) 
1 jil pBluescript KS (0.2fig/[il) 
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1 ul polymerase (~1 .5u/ 

2ul I0X buffer (reaction buffer 1 for all but L408H which uses 1 .5mM MgCl 2 , buffer (see 
below) 

IPX Reaction Buffer 1 
260mMTrispH9.5 
65mM MgCl 2 

IPX 1.5mMMgCh buffer 
24 mMMgCh 

26P mM Tris pH 9.5 

2.5 jul of each dye-labeled ddNTP terminator (ddGTP, ddATP, ddTTP and 
ddGTP was aliquotted separately into one of four tubes. 4.5ul of each polymerase 
reaction was added to each of the four tubes, to give a final reaction volume of 7ul. 
d. Cycle Sequencing Reactions 

The samples were cycled in a RoboCycler®96 Temperature Cycler with 
a Hot Top Assembly (Stratagene Catalog #4PP87P and #4PP894) using the following conditions: 

1) 1 minute at 95°C 

2) 1 minute at 95°C 

3) 1 minute at 5P°C 

4) 2 minutes at 72 °C 

5) Repeat steps 2-4 thirty times. 

4 ul of stop solution (95% formamide, 2PmM EDTA, P.P5% bromophenol blue, P.P5% 
xylene cyanol FF) was added to each of the amplified reactions before heating them to 99°C for 
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five minutes. The samples were electrophoresed on a 6% CASTAWAY™ gel as described 
above. The gels were dried and then exposed to film for 72 hours (Figure 7). 

The results of studies designed to assess utilization of dye-labeled ddNTPs by the different 
polymerase clones are shown in Figure 7. Clones p8 (panel C) and 1-1 (panel D) exhibited 
significantly improved incorporation of R6G-ddATP and Rl 10-ddGTP, compared to the parental 
clone (panel A). Improvement was evidenced by the synthesis of sequencing ladders at .Olx (1) 
and O.lx (2) dye-ddNTP/dNTP ratios. Optimization of reaction conditions and/or dye moieties 
may be performed to realize improvements in the incorporation of ddTTP and ddCTP. 

2 iii. Sequencing with double-mutant exo" JDF-3 DNA polymerase. 

To verify that changes at residues 408, 410, and 485 were sufficient to improve ddNTP 
0 incorporation, individual mutations were introduced into the parental 550 (JDF-3 exo" DNA 
7 polymerase) clone by site-directed mutagenesis. In addition, point mutations were combined to 
, examine whether they resulted in further improvements in dideoxynucleotide incorporation over 
u polymerases bearing single mutations. 

3 DNA sequencing reactions consisting of lx reaction buffer, 0.15pmol/)il long -20 primer, and 
lOng/jug pBluescript KS were prepared as follows: 

81|il H 2 0 

9jli1 -20 long primer (2pmol/|jl) 

6]al pBluescript KS (0.2pg/jui) 

**|ul polymerase 

12jul 10X buffer (260mM Tris pH 9.5, 65mM MgCl 2 ) 

18|nl of the cocktail listed above was aliquotted into the appropriate number of tubes 
(one per polymerase). Each polymerase (2 pi) was added to an aliquot of cocktail and the tubes 
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were mixed well. Each resulting polymerase mixture (4.5 pi) was then added to each of four 
tubes, already containing 0.06mM of one of the four - 33 P-dideoxynucleotides (ddATP, ddTTP, 
ddGTP or ddTTP; 1500Ci/mmol; 450^Ci/ml) and 6mM each deoxynucleotide in a volume of 
2.5|iL 

The sequencing reactions were cycled in a ROBOCYCLER®96 temperature cycler 
with a Hot Top Assembly using the following conditions: 

1) 1 minute at 95°C 

2) 45 seconds at 95°C 

3) 45 seconds at 60°C 

4) 1.5 minutes at 72°C 

5) Repeat steps 2-4 thirty times. 

Stop solution (pi; 95% formamide, 20mM EDTA, 0.05% bromophenol blue, 0.05% 
xylene cyanol FF) was added to each reaction before heating to 99°C for five minutes. Each 
sample (4jul) was loaded onto a 6% acrylamide denaturing CastAway gel. The gel was run and 
treated as described previously. 

Figure 8 shows that the P410L/A485T double mutant exhibits exceptionally even 
signals. Band uniformity was improved compared to mutant p8 (P410L mutation plus ancillary 
mutations that do not include A485T) and mutant A485T (data not shown). Mutant p8 exhibited 
a tendency to preferentially incorporate ddGTP and ddCTP in a sequence-dependent fashion. 
The optimal amount of enzyme may be higher than the quantity tested in this experiment. 
Sequence produced by the commercially available Family A DNA polymerase mutant, Thermo 
Sequenase, is shown in panel E. 
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iv. Ribonucleotide incorporation by JDF-3 polymerase mutants. 

A primer annealed to single stranded DNA template was extended in a mixture 
containing all ribonucleotides or all deoxynucleotides with the mutant and progenitor 
polymerases. 

M13mpl8+ single stranded DNA was annealed to 95x molar excess of the 38mer primer 
by heating the mixture to 95°C and cooling slowly at room temperature. 
3 8mer primer: 5 5 GGTTTTCCCAGTC ACGACGTTGTAAAACGACGGCCAGT 3 ' 

Preliminary assays were carried out to determine what dilutions of enzyme would be 

1 necessary to examine the incorporation activity at non-maximal levels. The final assay solutions 

were composed as described below: 

Ribonucleotide mixture 
J 20 ng/jil annealed primer/template 

* 1 x Cloned Pfu buffer (Stratagene catalog #200532) 

J 200^iMeach GTP,UTP,ATP 

j 50^iM CTP 

J 1 jaM 5- 3 H CTP 20.2 Ci/mmole 

0.05-0.3 units JDF-3 polymerase* 

Deoxvribonucleotide mixture 

20 ng/pl annealed primer template 

Ix Cloned Pfu buffer 

200(iM each dGTP, dATP 5 dCTP 

50jiM TTP (deoxyribonucleotide) 

IjiM Thymidine 5 '-triphosphate, [methyl- 3 H] 20.5 Ci/mmole 

0.05-0.3 units JDF-3 polymerase* 
* Added separately 
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Nine microliters of the polymerase-free mixtures were placed in 0.2ml tubes before the 
polymerases were added. The samples were incubated at 72°C in a ROBOCYCLER®96 
temperature cycler with Hot Top Assembly (Stratagene Catalog Nos. 400870 and 400894). The 
deoxyribonucleotide mixture was removed at 2 minutes and placed at approximately 2°C. The 
ribonucleotide mixture was incubated for 30 minutes. Seven microliters of the assay mixture 
were spotted onto DE81 filter circles (Whatmann) and dried prior to being washed three times in 
2x SSC (0.3M NaCl, 0.03M sodium citrate) for five minutes each wash. The filters were rinsed 
twice in ethanol and allowed to dry before being quantified with a scintillation counter. 

Background counts per minute (CPM) for the deoxyribonucleotide and the ribonucleotide 
reactions were subtracted from the respective averaged CPM value of duplicate samples for each 
enzyme. The background-corrected ribonucleotide CPM value was divided by the background- 
corrected deoxyribonucleotide CPM value (Figure 9). 



Polymerase Ratio NTP/dNTP Relative to JDF-3 

550 



JDF-3 550 0.000165162 1 

JDF-3 L408H 0.041087258 249 

JDF-3 L408F 0.051703924 313 

JDF-3 A485T 0.007628583 46 

v. Ribonucleotide sequencing with JDF-3 polymerase mutants. 

Ribonucleotides incorporated into a deoxyribonucleotide polymer are susceptible to 
alkali hydrolysis which can produce a sub-population of polymer lengths. When labeled primer 
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is extended in the presence of a particular ribonucleotide base (for example ATP) and the four 
deoxyribonucleotide bases, the fragments resulting from alkali hydrolysis create a population of 
different lengths, which correspond to all the possible positions where ATP was incorporated. 
When those fragments are size separated, their migration pattern, with respect to other 
ribonucleotide base (CTP, UTP and GTP) hydrolysis products allows the template sequence to 
be read. As described previously, most DNA polymerases discriminate against non-conventional 
deoxynucleotides. A subset of the JDF-3 DNA polymerase mutants which allow improved 
uptake of the unconventional dideoxynucleotides also show improved tolerance for 
ribonucleotide incorporation. 

lOOng of the 38mer primer was kinased with y- 33 P according to the instructions in the 
KINACE-IT™ Kinasing Kit (Stratagene catalog #300390). 

38mer primer: 5' GGTTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGT 3 ' 

The labeled oligonucleotide was purified from contaminating free nucleotides with a 
NUC TRAP® Probe Purification Column (Statagene catalog #400701) in 10T.1E (lOmM Tris 
pH 8.0, O.lmM EDTA). Labeled oligonucleotide (-7 picomoles) was annealed to 0.09 pmoles 
M13mpl8+ by heating to 95°C then cooling to room temperature in the presence of 0.32mM 
MgCl 2 . 

Extension components 

0.054 pM annealed primer/template 

200 DMeach dNTP 

Ix cPfu DNA polymerase buffer (Stratagene catalog #200532) 

4-200 ATP* 

0.1-5 Units JDF-3 polymerase* 
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* Added separately 

Eight microliters of a cocktail containing the first three components listed above were 
aliquoted into a 0.2ml tube, ljul of polymerase and 1 jil of 2mM, 0.2mM or 0.4mM ATP were 
added and the reaction was incubated at 72°C for 15 minutes. The reaction volume was brought 
to lOOpl with lx cPfo polymerase buffer and transferred to a 1.5ml tube. After heating the 
reactions in the presence of 70mM NaOH for 15 minutes at 100°C, the reaction was neutralized 
with 70mM HC1 and precipitated through the addition of lOjil 3M sodium acetate and 327.5jil of 
ethanol. The samples were microcentrifuged for 30 minutes at 14krpm before the supernatant 
was removed and the pellet washed in 80% ethanol. After vacuum drying, the samples were 
resuspended in 5\i\ of sequencing stop solution (95% formamide, 20mM EDTA, 0.05% 
bromophenol blue, 0.05% xylene cyanol FF) and 2.5jal was loaded on a 6% acylamide-7M urea, 
lx TBE CASTAWAY™ Precast gel (Stratagene catalog numbers 401090 and 401094). The gels 
were run at 50 watts until the bromophenol blue dye migrated past the bottom of the gel after 
which the gel was fixed, dried and exposed to film for 72 hours. 

Sequencing ladders for JDF-3 550 (wild-type nucleotide incorporation) and all the 
mutants tested were visible at the 200juM and 20pM ATP level. At the 4jiM level, only the 
L408H and L408F mutants produced ladders (data not shown), 
vi. Sequencing with dye-dideoxynucleotide terminators 

Primer was extended in the presence of FAM ddCTP (NENNEL481). The sequence 
reactions were purified and run on an ABI 370. 

Reaction conditions for cycle-sequencing were as described below: 

IX cPFU buffer, 200 ng pBluescript II KS plasmid, 3 pmole T7 primer, 0.23 mM dCTP, 
0.23 mM dATP, 0.23 mM dTTP, 0.23 mM dGTP with 0.046 mM FAM ddCTP. The samples 
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were cycled in a Perkin-Elmer cycler in 10 jliI volumes for 25 cycles of the temperatures and 

times described below: 

95°C 30s 
55°C 30s 
72°C 2 min 

The samples were purified using CentriSep columns according to the manufacturer's 
instructions. After drying, the samples were resuspended in 3 jllI of a loading dye comprised of 
66.7% deionized formamide, 16.7 mg/ml Blue Dextran, and 8.3 mM EDTA. Samples were 
heated at 95°C for three minutes and loaded on a 5% LongRangen gel in an ABI PRISM 377 
DNA sequencer. 

Data was processed in Gene Scan 2.1. 
EXAMPLE 2 . 
Labeling of DNA. 

The modified DNA polymerases of the invention are applicable to labeling of DNA. It is 
known to those skilled in the art that there are several means by which to label DNA, including 
the incorporation of radiolabeled nucleotides. One such common means is by random priming, 
which enables one of skill in the art to generate labeled DNA fragments, typically about 50 to 
about 1000 bases long. The procedure described herein are adapted from F. Ausubel et al., Short 
Protocols in Molecular Biology, Third Edition, John Wiley and Sons, Inc., 1995. 

As a first step toward random priming DNA, a reaction mix containing 2.5 microliters 0.5 
mM 3dNTP (dCTP, dGTP, TTP, each at 0.5 mM), 50 jiCi [ - 32 P]dATP, 1 microliter of 3 to 8 
units/microliter DNA polymerase in 50 mM Tris-HCl, pH 7.5, 10 mM MgCl 2? 1 mM 
dithiothreitol, 0.05 mg/ml bovine serum albumin is prepared in a total volume of 1 1 microliters 
and incubated on ice. Next, about 30 to about 100 ng of DNA is mixed with about with 1 to 5 |ug 
of random hexanucleotides in 14 microliters and boiled for 2 to 3 minutes and then placed on ice. 
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The 1 1 microliter reaction mix is then added to the DNA/random hexamer mix, and the random 
priming reaction is incubated over 10 minutes to as much as 4 hours at room temperature. To 
stop the reaction, 1 microliter 0.5 M EDTA, 3 microliters 10 mg/ml tRNA, and 100 microliters 
10 mM Tris-HCl, pH 7.4 is added and the mixture is extracted with phenol. The labeled DNA is 
then separated from unincorporated radioactive precursors by chromatography. 

R, Gel Assay for Dye-dideoxynucleotide Incorporation. 

A labeled oligonucleotide duplex was extended with a mixture of dideoxynucleotides and 
dye-dideoxynucleotides. When the duplex was separated on a denaturing 20% Acrylamide/7 M 
urea gel, labeled oligonucleotides terminated with a dideoxynucleotide could be resolved from 
oligonucleotides terminated with dye-deoxynucleotides. 

Oligonucleotides: 

259C 32 P-TAACGTTGGGGGGGGGCA~> 
258C TGCAACCCCCCCCCGTAT 

The 5' end of 259C was labeled and purified as described in Section Q.ii.a except that 32 Py- 
ATP was used. The labeled oligonucleotide 259C was at a concentration of approximately 0.7 
ng/pl The complimentary oligonucleotide (258C) was added as an equal concentration, heated 
to 95°C for three minutes, 50°C for 5 minutes and room temperature for 20 minutes. Heat killed 
ly sates of the relevant mutants were prepared as described in Example section C. The reactions 
were incubated in a 5 jxl volume composed of 30 mM Tris pH 8.0 and 3 mM MgCl 2 with a 
nucleotide mixture totaling 0.1 mM. The ratio of ddTTP to FLU ddUTP or ROXddUTP was 
10:1. The dimer was present at a concentration of 1 .2 picomoles and 0.5 jxl of enzyme or crude 
lysate or purified enzyme was added to the reaction before incubation at 50°C in the 
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RobeCycler® Gradient 96 Temperature Cycler with Hot Top. The samples were incubated for 
20s before 3 ul of a formamide based loading dye was added and the samples were heat- 
denatured at 95°C for 3 minutes then loaded onto a 20% acrylamide/7 M urea gel and subjected 
to electrophoresis at a constant 60 watts. The gel was exposed to X-ray film and the film was 
analyzed in the EagleEye® Eagle Sight software package. 
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OTHER EMBODIMENTS 
Other embodiments will be evident to those of skill in the art. It should be understood 

that the foregoing detailed description is provided for clarity only and is merely exemplary. The 

spirit and scope of the present invention are not limited to the above examples, but are 

encompassed by the following claims. 
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CLAIMS 



1 . A purified thermostable DNA polymerase having an amino acid sequence presented in 
SEQ ID NO: 2 from residue 1 to 776. 

2. The polymerase of claim 1 that is isolated from Thermococcus species JDF-3. 

3 . The polymerase of claim 1 that is isolated from a recombinant organism transformed with 
a vector that codes for the expression of Thermococcus species JDF-3 DNA polymerase. 

4. A recombinant vector comprising the nucleotide sequence presented in SEQ ID NO: 1 . 

5. An isolated recombinant polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2. 

6. An isolated recombinant DNA polymerase from Thermococcus species JDF-3 that is 3' 
to 5' exonuclease deficient. 

3 7. The isolated recombinant DNA polymerase of claim 6, wherein said polymerase has a 
3 dual mutation comprising a serine to proline mutation at a site corresponding to S345 of SEQ ID 
ri NO: 2; and a proline to leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 

0 8. The isolated recombinant DNA polymerase of claim 6 that has an aspartic acid to 

;f threonine or alanine mutation at the amino acid corresponding to D 141 of SEQ ID NO: 2 or a 
~ glutamic acid to alanine mutation at the amino acid corresponding to El 43 of SEQ ID NO: 2. 

9. The isolated recombinant DNA polymerase of claim 6 that has an aspartic acid to 
Z threonine or alanine mutation at the amino acid corresponding to D 141 of SEQ ID NO: 2 and a 
;i glutamic acid to alanine mutation at the amino acid corresponding to El 43 of SEQ ID NO: 2. 

1 1 0. An isolated recombinant DNA polymerase having reduced discrimination against non- 
l conventional nucleotides. 

1 1 . The DNA polymerase of claim 6 or 1 0 that is a Family B DNA polymerase. 

1 2. The DNA polymerase of claim 6 or 1 0 wherein said DNA polymerase further comprises 
a mutation selected from the group consisting of: a leucine to histidine mutation at a site 
corresponding to L408 of SEQ ID NO: 2; a leucine to phenylalanine mutation at a site 
corresponding to L408 of SEQ ID NO: 2; a proline to leucine mutation at a site corresponding to 
P410 of SEQ ID NO: 2; and an alanine to threonine mutation at a site corresponding to A485 of 
SEQ ID NO: 2. 

1 3 . The DNA polymerase of claim 1 2 wherein said polymerase having said alanine to 
threonine mutation at said site corresponding to A485 of SEQ ID NO:2 further comprises a 
mutation selected from the group consisting of: a leucine to histidine mutation at a site 
corresponding to L408 of SEQ ID NO: 2; a leucine to phenylalanine mutation at a site 
corresponding to L408 of SEQ ID NO: 2; a serine to proline mutation at a site corresponding to 
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S345 of SEQ ID NO: 2; and a proline to leucine mutation at a site corresponding to P410 of SEQ 
ID NO: 2. 

14. The DNA polymerase of claim 6 or 10 that has reduced discrimination against a non- 
conventional nucleotide selected from the group consisting of: dideoxynucleotides, 
ribonucleotides and conjugated nucleotides. 

15. The DNA polymerase of claim 14 wherein said conjugated nucleotide is selected from 
the group consisting of radiolabeled nucleotides, fluorescently labeled nucleotides, biotin labeled 
nucleotides, chemiluminescently labeled nucleotides and quantum dot labeled nucleotides. 

16. An isolated recombinant Family B DNA polymerase comprising an alanine to threonine 
mutation at the site corresponding to A485 of SEQ ID NO: 2, or a mutation at a site 
corresponding to L408, S345 or P410 of SEQ ID NO: 2, wherein said DNA polymerase has 
reduced discrimination against non-conventional nucleotides relative to the wild-type form of 
that polymerase. 

17. The polymerase of claim 16 that is 3' to 5' exonuclease deficient. 

18. The polymerase of claim 1 7 that has a mutation at an amino acid corresponding to D141 
orE143 of SEQ ID NO: 2. 

19. The polymerase of claim 17 that has an aspartic acid to threonine or alanine mutation at 
a site corresponding to D141 of SEQ ID NO: 2. 

20. The polymerase of claim 17 that has a glutamic acid to alanine mutation at a site 
corresponding to E143 of SEQ ID NO: 2. 

2 1 . The polymerase of claim 20 that has an aspartic acid to threonine or alanine mutation at 
the amino acid corresponding to D141 of SEQ ID NO: 2. 

22. The polymerase of claim 16 that is thermostable. 

23. The polymerase of claim 16 that is archaeal 

24. The polymerase of claim 16 wherein said DNA polymerase comprises a leucine to 
histidine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

25. The polymerase of claim 16 wherein said DNA polymerase comprises a leucine to 
phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

26. The polymerase of claim 16 wherein said DNA polymerase comprises a proline to 
leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 

27. The polymerase of claim 16 wherein said DNA polymerase comprises a serine to proline 
mutation at a site corresponding to S345 of SEQ ID NO: 2, wherein said polymerase may further 
comprise a mutation at a site corresponding to T604 of SEQ ID NO: 2. 
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28. The polymerase of claim 16, wherein said DNA polymerase comprises a tyrosine to 
cysteine mutation at a site corresponding to Y497 of SEQ ID NO: 2, wherein said polyerase may 
further comprise an isoleucine to valine mutation at a site corresponding to 1630 of SEQ ID NO: 
2. 

29. The polymerase of claim 16, wherein said DNA polymerase comprises a glutamic acid to 
lysine mutation at a site corresponding to E645 of SEQ ID NO: 2. 

30. The polymerase of claim 16, wherein said DNA polymerase comprises a glutamic acid 
to lysine mutation at a site corresponding to E578 of SEQ ID NO: 2, wherein said polymerase 
may further comprise an arginine to methionine mutation at a site corresponding to R465 of SEQ 
ID NO: 2. 

3 1 . The polymerase of claim 1 6, wherein said DNA polymerase comprises a leucine to 
glutamine mutation at a site corresponding to L396 of SEQ ID NO: 2, wherein said polymerase 
further comprises a mutation at a site corresponding to V401, N424 5 P569, E617, or V640 of 
SEQ ID NO: 2. 

^ 32. The polymerase of claim 1 6, wherein said DNA polymerase comprises a serine to 
S asparagene mutation at a site corresponding to S651 of SEQ ID NO: 2. 

fi 33. The polymerase of claim 16, wherein said DNA polymerase comprises a leucine to 

proline mutation at a site corresponding to L396 of SEQ ID NO: 2, wherein said polymerase may 
z further comprise a mutation at a site corresponding to E459 of SEQ ID NO: 2. 

34. The polymerase of claim 16, wherein said DNA polymerase comprises a leucine to 
proline mutation at a site corresponding to L456 of SEQ ID NO: 2, wherein said polymerase may 
3 further comprise a mutation at a site corresponding to E658 of SEQ ID NO: 2. 

I 35. The polymerase of claim 1 6, wherein said DNA polymerase comprises a leucine to 
3 histidine mutation at a site corresponding to L408 of SEQ ID NO: 2, wherein said polymerase 
sl may further comprise a mutation at a site corresponding to V437, or L478 of SEQ ID NO: 2. 

36. The polymerase of claim 16 wherein said DNA polymerase comprises an tyrosine to 
asparagine mutation at a site corresponding to Y496 of SEQ ID NO: 2. 

37. The polymerase of claim 16 wherein said DNA polymerase comprises an alanine to 
threonine mutation at a site corresponding to A485 of SEQ ID NO: 2. 

38. The polymerase of claim 37 comprising a leucine to histidine mutation at a site 
corresponding to L408 of SEQ ID NO: 2, 

39. The polymerase of claim 37 comprising a leucine to phenylalanine mutation at a site 
corresponding to L408 of SEQ ID NO: 2. 

40. The polymerase of claim 37 comprising a proline to leucine mutation at a site 
corresponding to P410 of SEQ ID NO: 2. 
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41 . The polymerase of claim 1 6, having reduced discrimination against a non-conventional 
nucleotide selected from the group consisting of: dideoxynucleotides, ribonucleotides and 
conjugated nucleotides. 

42. The polymerase of claim 41 wherein said conjugated nucleotide is selected from the 
group consisting of radiolabeled nucleotides, fluorescently labeled nucleotides, biotin labeled 
nucleotides, chemiluminescently labeled nucleotides and quantum dot labeled nucleotides. 

43. The polymerase of claim 10 or 16 further comprising a mutation at an amino acid 
residue in the polymerase that corresponds to a mutation selected from the group consisting of: a 
Y to V mutation at amino acid 409 of SEQ ID NO:2; an A to C, S, L, I, F, or V mutation at 
amino acid 485 of SEQ ID NO: 2; a Y to S mutation at amino acid 494 of SEQ ID NO: 2; a Y to 
L mutation at amino acid 496 of SEQ ID NO: 2; and an A to Y mutation at amino acid 490 of 
SEQ ID NO: 2. 

44. The polymerase of claim 10 or 16 further comprising a mutation at an amino acid of the 
polymerase corresponding to one of amino acids 483 to 496, inclusive, of SEQ ID NO: 2. 

45. The polymerase of claim 44 wherein said mutation is at an amino acid of the 
polymerase corresponding to one of amino acids 485, 490, 494, or 496 of SEQ ID NO: 2. 

46. An isolated recombinant Family B DNA polymerase comprising an alanine to threonine 
mutation at an amino acid corresponding to A485T of SEQ ID NO: 2 and and at least one 
substitution in the polymerase of an amino acid corresponding to L408, Y409, S345 or P410 
respectively, of SEQ ID NO: 2. 

47. An isolated recombinant Family B DNA polymerase comprising an amino acid other 
than A at an amino acid of the polymerase corresponding to A485 of SEQ ID NO: 2, and at least 
one substitution in the polymerase of an amino acid corresponding to L408, Y409, S345 or P410, 
respectively, of SEQ ID NO: 2. 

48. A recombinant vector comprising a nucleic acid sequence encoding the DNA 
polymerase of any one of claims 1, 5, 6, 10, 16, 46 or 47. 

49. A method of labeling a complementary strand of DNA, which method comprises the 
step of: 

contacting a template DNA molecule with: 

a recombinant Family B DNA polymerase of claim 10; and 
a non-conventional nucleotide under conditions and for a time sufficient to permit 
said DNA polymerase to synthesize a complementary DNA strand and to incorporate said non- 
conventional nucleotide into said complementary DNA strand. 

50. A method of labeling a complementary strand of DNA, which method comprises the 
step of: 

contacting a template DNA molecule with: 
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a recombinant Family B DNA polymerase comprising an alanine to threonine 
mutation at a site corresponding to A485 of SEQ ID NO: 2 or a mutation at a site corresponding 
to L408 or P410 or S345 of SEQ ID NO: 2, wherein said DNA polymerase has reduced 
discrimination against non-conventional nucleotides; and 

a non-conventional nucleotide under conditions and for a time sufficient to permit 
said DNA polymerase to synthesize a complementary DNA strand and to incorporate said non- 
conventional nucleotide into said complementary DNA strand. 

5 1 . The method of claim 50 wherein said recombinant Family B DNA polymerase is 3 ? -5' 
exonuclease deficient. 

52. The method of any one of claims 49-51 herein said recombinant Family B polymerase 
comprises a leucine to histidine mutation at a site corresponding to amino acid L408 of SEQ ID 
NO: 2. 

53 . The method of any one of claims 49-5 1 wherein said recombinant Family B polymerase 
comprises a leucine to phenylalanine mutation at a site corresponding to amino acid L408 of 
SEQ ID NO: 2. 

54. The method of any one of claims 49-51 wherein said recombinant Family B polymerase 
comprises a proline to leucine mutation at a site corresponding to amino acid P410 of SEQ ID 
NO: 2. 

55. The method of any one of claims 49-51 wherein said recombinant Family B polymerase 
comprises an alanine to threonine mutation at a site corresponding to amino acid A485 of SEQ 
ID NO: 2. 

56. The method of claim 55 wherein said recombinant Family B polymerase comprises a 
leucine to histidine mutation at an amino acid corresponding to L408 of SEQ ID NO: 2. 

57. The method of claim 55 wherein said recombinant Family B polymerase comprises a 
leucine to phenylalanine mutation at an amino acid corresponding to L408 of SEQ ID NO: 2. 

58. The method of claim 55 wherein said recombinant Family B polymerase comprises a 
proline to leucine mutation at an amino acid corresponding to P410 of SEQ ID NO: 2. 

59. The method of claim 55 wherein said recombinant Family B polymerase comprises a 
serine to proline mutation at an amino acid corresponding to S3 45 of SEQ ID NO: 2. 

60. The method of claim 55 , wherein said recombinant Family B polymerase has a dual 
mutation comprising a serine to proline mutation at a site corresponding to S345 of SEQ ID NO: 
2; and a proline to leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 

61 . The method of any one of claims 49-5 1 wherein said recombinant Family B polymerase 
has reduced discrimination against a non-conventional nucleotide selected from the group 
consisting of: dideoxynucleotides, ribonucleotides, and conjugated nucleotides. 
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62. The method of claim 61 wherein said conjugated nucleotide is selected from the group 
consisting of radiolabeled nucleotides, fluorescently labeled nucleotides, biotin labeled 
nucleotides, chemiluminescently labeled nucleotides and quantum dot labeled nucleotides. 

63. A method of sequencing DNA comprising the steps of: 
contacting a template DNA strand with: 

a sequencing primer; 

a recombinant Family B DNA polymerase of claim 9; and 

a chain-terminating nucleotide analog, under conditions that permit said DNA 
polymerase to synthesize a complementary DNA strand and to incorporate nucleotides into the 
synthesized complementary DNA strand, wherein incorporation of a chain-terminating 
nucleotide analog results in the termination of chain elongation, such that the nucleotide 
sequence of said DNA strand is determined. 

64. A method of sequencing DNA comprising the steps of: 
contacting a DNA strand with: 

a sequencing primer; 

a recombinant Family B DNA polymerase comprising an alanine to threonine 
mutation at a site corresponding to A485 of SEQ ID NO: 2 or a mutation at a site corresponding 
to L408 or P410 or S345 of SEQ ID NO: 2, wherein said DNA polymerase has reduced 
discrimination against non-conventional nucleotides; and 

a chain-terminating nucleotide analog, under conditions that permit said DNA 
polymerase to synthesize a complementary DNA strand and to incorporate nucleotides into the 
synthesized complementary DNA strand, wherein incorporation of a chain-terminating 
nucleotide analog results in the termination of chain elongation, such that the nucleotide 
sequence of said DNA strand is determined, 

65. The method of claim 64 wherein said recombinant Family B polymerase is deficient in 
3' to 5' exonuclease activity. 

66. The method of any one of claims 63-65 wherein said recombinant Family B polymerase 
has a leucine to histidine mutation at a site corresponding to amino acid L408 of SEQ ID NO: 2. 

67. The method of any one of claims 63-65 wherein said recombinant Family B polymerase 
has a leucine to phenylalanine mutation at a site corresponding to amino acid L408 of SEQ ID 
NO: 2. 

68. The method of any one of claims 63-65 wherein said recombinant Family B polymerase 
has a proline to leucine mutation at a site corresponding to amino acid P410 of SEQ ID NO: 2. 
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69. The method of any one of claims 63-65 wherein said recombinant Family B polymerase 
has an alanine to threonine mutation at a site corresponding to amino acid A485 of SEQ ID NO: 
2. 

70. The method of claim 69 wherein said recombinant Family B polymerase has a leucine to 
histidine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

71 . The method of claim 69 wherein said recombinant Family B polymerase has a leucine to 
phenylalanine mutation at a site corresponding to L408 of SEQ ID NO: 2. 

72. The method of claim 69 wherein said recombinant Family B polymerase has a proline to 
leucine mutation at a site corresponding to P410 of SEQ ID NO: 2. 

73. The method of claim 69 wherein said recombinant Family B polymerase has a serine to 
proline mutation at a site corresponding to S345 of SEQ ID NO: 2. 

74. The method of any one of claims 63-65 wherein said chain-terminating nucleotide 
analog is a dideoxynucleotide. 

75. The method of claim 74 wherein said dideoxynucleotide is detectably labeled. 

76. The method of claim 75 wherein said dideoxynucleotide is fluorescently labeled. 

77. The method of claim 76 wherein said dideoxynucleotide is labeled with a moiety 
selected from the group consisting of fluorescein and rhodamine. 

78. A kit for performing the method of claim 49, 50, 5 1 , 63, or 64. 

79. A method of making a purified thermostable DNA polymerase having an amino acid 
sequence presented in SEQ ID NO: 2 from residue 1 to 776, comprising culturing a host cell 
containing the nucleic acid sequence presented in SEQ ID NO:l under conditions which permit 
production of said DNA polymerase. 

80. A method of making a recombinant DNA polymerase of Thermococcus species JDF-3 
that is 3' to 5' exonuclease deficient, comprising culturing a host cell containing a nucleic acid 
sequence encoding said polymerase under conditions which permit production of said DNA 
polymerase. 

81. A method of making a recombinant DNA polymerase having reduced discrimination 
against non-conventional nucleotides, comprising culturing a host cell containing a nucleic acid 
sequence encoding said polymerase under conditions which permit production of said DNA 
polymerase. 

82. A method of making a recombinant Family B DNA polymerase comprising an alanine to 
threonine mutation at the site corresponding to A485 of SEQ ID NO: 2 or a mutation at a site 
corresponding to L408 or P410 or S345 of SEQ ID NO: 2, wherein said DNA polymerase has 
reduced discrimination against non-conventional nucleotides relative to the wild-type form of 



that polymerase, comprising culturing a host cell containing a nucleic acid sequence encoding 
said polymerase under conditions which permit production of said DNA polymerase. 

83. The method of claim 79, 80, 81, or 82, wherein said host cell is E. Colt 

84. The method of claim 79, 80, 81, or 82, wherein said host cell is Thermococcus. 
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FIG. I. 

JDF-3 DNA polymerase nucleotide sequence; 2331 nucleotides (SEQ ID NO: 1) 

ATGATCCTTGACGTTGATTACATCACCGAGAATGGAAAGCCCGTCATCAGGGTCTTCAAGAAGGAGAACGG 
CGAGTTCAGGATTGAATACGACCGCGAGTTCGAGCCCTACTTCTACGCGCTCCTCAGGGACGACTCTGCCA 
TCGAAGAAATCAAAAAGATAACCGCGGAGAGGCACGGCAGGGTCGTTAAGGTTAAGCGCGCGGAGAAGGTG 
AAGAAAAAGTTCCTCGGCAGGTCTGTGGAGGTCTGGGTCCTCTACTTCACGCACCCGCAGGACGTTCCGGC 
AATCCGCGACAAAATAAGGAAGCACCCCGCGGTCATCGACATCTACGAGTACGACATACCCTTCGCCAAGC 
GCTACCTCATAGACAAGGGCCTAATCCCGATGGAAGGTGAGGAAGAGCTTAAACTCATGTCCTTCGACATC 
GAGACGCTCTACCACGAGGGAGAAGAGTTTGGAACCGGGCCGATTCTGATGATAAGCTACGCCGATGAAAG 
CGAGGCGCGCGTGATAACCTGGAAGAAGATCGACCTTCCTTACGTTGAGGTTGTCTCCACCGAGAAGGAGA 
TGATTAAGCGCTTCTTGAGGGTCGTTAAGGAGAAGGACCCGGACGTGCTGATAACATACAACGGCGACAAC 
TTCGACTTCGCCTACCTGAAAAAGCGCTGTGAGAAGCTTGGCGTGAGCTTTACCCTCGGGAGGGACGGGAG 
CGAGCCGAAGATACAGCGCATGGGGGACAGGTTTGCGGTCGAGGTGAAGGGCAGGGTACACTTCGACCTTT 
ATCCAGTCATAAGGCGCACCATAAACCTCCCGACCTACACCCTTGAGGCTGTATACGAGGCGGTTTTCGGC 
AAGCCCAAGGAGAAGGTCTACGCCGAGGAGATAGCCACCGCCTGGGAGACCGGCGAGGGGCTTGAGAGGGT 
CGCGCGCTACTCGATGGAGGACGCGAGGGTTACCTACGAGCTTGGCAGGGAGTTCTTCCCGATGGAGGCCC 
AGCTTTCCAGGCTCATCGGCCAAGGCCTCTGGGACGTTTCCCGCTCCAGCACCGGCAACCTCGTCGAGTGG 
TTCCTCCTAAGGAAGGCCTACGAGAGGAACGAACTCGCTCCCAACAAGCCCGACGAGAGGGAGCTGGCGAG 
GAGAAGGGGGGGCTACgcCGGTGGCTACGTCAAGGAGCCGGAGCGGGGACTGTGGGACAATATCGTGTATC 
TAGACTTTCGTAGTCTCTACCCTTCAATCATAATCACCCACAACGTCTCGCCAGATACGCTCAACCGCGAG 
GGGTGTAGGAGCTACGACGTTGCCCCCGAGGTCGGTCACAAGTTCTGCAAGGACTTCCCCGGCTTCATTCC 
GAGCCTGCTCGGAAACCTGCTGGAGGAAAGGCAGAAGATAAAGAGGAAGATGAAGGCAACTCTCGACCCGC 
TGGAGAAGAATCTCCTCGATTACAGGCAACGCGCCATCAAGATTCTCGCCAACAGCTACTACGGCTACTAC 
GGCTATGCCAGGGCAAGATGGTACTGCAGGGAGTGCGCCGAGAGCGTTACGGCATGGGGAAGGGAGTACAT 
CGAAATGGTCATCAGAGAGCTTGAGGAAAAGTTCGGTTTTAAAGTCCTCTATGCAGACACAGACGGTCTCC 
ATGCCACCATTCCTGGAGCGGACGCTGAAACAGTCAAGAAAAAGGCAATGGAGTTCTTAAACTATATCAAT 
CCCAAACTGCCCGGCCTTCTCGAACTCGAATACGAGGGCTTCTACGTCAGGGGCTTCTTCGTCACGAAGAA 
AAAGTACGCGGTCATCGACGAGGAGGGCAAGATAACCACGCGCGGGCTTGAGATAGTCAGGCGCGACTGGA 
GCGAGATAGCGAAGGAGACGCAGGCGAGGGTTTTGGAGGCGATACTCAGGCACGGTGACGTTGAAGAGGCC 
GTCAGAATTGTCAGGGAAGTCACCGAAAAGCTGAGCAAGTACGAGGTTCCGCCGGAGAAGCTGGTTATCCA 
CGAGCAGATAACGCGCGAGCTCAAGGACTACAAGGCCACCGGCCCGCACGTAGCCATAGCGAAgcGTTTGG 
CCGCCAGAGGTGTTAAAATCCGGCCCGGAACTGTGATAAGCTACATCGTTCTGAAGGGCTCCGGAAGGATA 
GGCGACAGGGCGATTCCCTTCGACGAGTTCGACCCGACGAAGCACAAGTACGATGCGGACTACTACATCGA 
GAACCAGGTTCTGCCGGCAGTTGAGAGAATCCTCAGGGCCTTCGGCTACCGCAAGGAAGACCTGCGCTACC 
AGAAGACGAGGCAGGTCGGGCTTGGCGCGTGGCTGAAGCCGAAGGGGAAGAAGAAGTGA 
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FIG. 2. 

JDF-3 DNA polymerase amino acid sequence (SEQ ID NO: 2) 
Theoretical molecular weight: 90.3 kD 



MILDVDYITENGKPVIRVFKKENGEFRIEYDREFEPYFYALLROT 

KKKFLGRSVEVWVLYFTHPQDVPAIRDKIRKH 

ETLYHEGEEFGTGPILMISYADESEARVITWKKIDLPYV^^ 

FDFAYLKKRCEKLGVS FTLGRDGS E PKIQRMGDRFAVEVKGRVHFDLYPVI RRT INLPTYTLEAVYEAVFG 
KPKEKVYAEEIATAWETGEGLERVARYSMEDARVTYELGREFFPM^ 

FLLRKAYERNELAPNKPDERELARRRGGYAGGYVKEPERGLWDNIVYLDFRSLYPS 1 1 ITHNVSPDTLNRE 

GCRSYDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEKNLLDYRQRAIKII^ 

GYARARWCRECAESVTAWGREYIEMVIRELEEKFGFKV^ 

PKLPGLLELEYEGFYVRGFFVTKKiCYAVIDEEGKITTRGL^^ 

VRIVREVTEKLSKYEVPPEKLVIHEQITRELKDYKATGPHVAIAKRLA^ 

GDRAI P FDE FDPTKHKYDADYYI ENQVLP AVERI LRAFGYRK^ 



FIG. 3. 

JDF-3 DNA polymerase with intein sequence (SEQ ID NO: 3) 

MILDVDYITENGKPVIRVFKKENGEFRIEYDREPEPYFYALLRDDSAIEE 

I KKI TAERHGRVVKVKRAEKVKKKFLGRSVEVWLYFTHPQDVPAI RDKI 

RKHPAVID I YE YD I PFAKRYL IDKGL I PMEGEEELKLMS FDI ETLYHEGE 

EFGTGP ILMI S YADESEARVI TWKKIDLP YVEWSTEKEMI KRFLRWKE 

KDPDVL I TYNGDNFDFAYLKKRCEKLGV S FTLGRDGSEPKI QRMGDRFAV Extein 1 

EVKGRVH FDLYP V I RRT I NLP TYTLEAVYEAVFGKPKE KVYAE E I AT AWE 

TGEGLERVARYSMEDARVTYELGREFFPMEAQLSRLIGQGLWDVSRSSTG 

NLVEWFLLRKAYERI^LAPNKPDERELARRRGGYAGGYVKEPERGIjWDNI 

VYLDFRS LYPS III THNVS PDTLNREGCRS YDVAPEVGHKFCKDFPGFI P 

S LLGNLLEERQKI KRKMKATLDPLEKNLLD YRQRAI KI LAN 

SLLPGEWVA 

VIEGGKLRPWIGELVDGLMEASGERVKRDGDTEVLEVEGLYASPSTGSP 

RKPAQCR*KP* *GTAMPGKFTE*LSTPEGGLSVTRGHSLFAYRDASLWR* * 

RGRRRFKPGDLLAVPSG*PSRRGGRGSTSLNCSSNCPRRKRPTCHRHSGK 

GRKNFFRGMLRTLRWIFGEEKTGGRPGATWSTLRGLGYVKLRKIGYGWD 

REGLGKVPRFYERLVEVIRYNGNRGEFIADFNAIiRPVLRIiMMPEKELEEW Intein 1 

LVGTRNGFR IRP FI EVDWKFAKLLGYYVS EGS AGKWKNRTGGWS YS VRLY 

NEDGSVLDDMERIaARSSLGA*ARGELRRDFKEDGLHNLRGALRFTGREQE 

GSVAYIiIVP*GGPLGLP*GVLHRRRRRSPEQDGSALHQERASG*RPRPAP 

ELAGRLSDKRPPRQRGLQGLRERGTALYRVPEAEERLTYSHVIPREVLEE 

TSAGPSRRT*VTGNSGSWWKAGSSTRKGPVG*AGSSTGI*SSTGSRKSGR 

KATRGTS TT * ALRRTRTSGGLWVPLRAQX 

S YYGYYGYARARWYCRE CAES 
VTAWGRE YI EMVIRELEEKFGFKVLYADTDGLHAT I PGADAETVKKKAME 
FIiNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEEGKI TTRGLE I VR 
RDWSEIAKETQARVLEAILRHGDVEEAVRIVREVTEKLSKYEVPPEKLVI Extein 2 
HEQITRELKDYKATGPHVAIAKRLAARGVKIRPGTVISYIVLKGSGRIGD 
RAIPFDEFDPTKHKYDADYYIENQVLPAVERILRAFGYRKEDLRYQKTRQ 
VGLGAWLKPKGKKK 



FIG. 4 

JDF-3 DNA polymerase genomic sequence (SEQ ID NO: 4) 

AATTCCACTGCCGTGTTTAACCTTTCCACCGTTGAACTTGAGGGTGATTT 
TCTGAGCCTCCTCAATCACTTAATCGAGACCGCGGATTACCTTGAACTGG 
TACACGTTCAACGATTCGGTTCTTGTAATGGTCGATACTGGGCCGTGCTG 

GATTTTCTAAACGTCTCAAGAACGGCTTTCATCAACGGAAACTGCCACGT 5' untranslated sequence 

CTCCGCCGTCGTGAGGGTTAAACCTGAAGTTCAAGACTTTGCAACGGAAT 

GGCGAGAGAACGGCGACTACCCCAGTGGAAGAGCTTTTGAAAGCCAAAGC 

CGAGCTTCAGCGAATGTGCGGTGCCCTTGTTCAAGAGTTGTGAGCCCTTG 

ATTGTTGTTTTCTCCTCTTTTCTGATAACATCGATGGCGAAGTTTATTAG 

TTCTCAGTTCGATAATCAGGCAGGTGTTGGTC 

ATGATCCTTGACGTTGAT 
TACATCACCGAGAATGGAAAGCCCGTCATCAGGGTCTTCAAGAAGGAGAA 
CGGCGAGTTCAGGATTGAATACGACCGCGAGTTCGAGCCCTACTTCTACG 
CGCTCCTCAGGGACGACTCTGCCATCGAAGAAATCAAAAAGATAACCGCG 
GAGAGGCACGGCAGGGTCGTTAAGGTTAAGCGCGCGGAGAAGGTGAAGAA 
AAAGTTCCTCGGCAGGTCTGTGGAGGTCTGGGTCCTCTACTTCACGCACC 
CGCAGGACGTTCCGGCAATCCGCGACAAAATAAGGAAGCACCCCGCGGTC 
ATCGACATCTACGAGTACGACATACCCTTCGCCAAGCGCTACCTCATAGA 
CAAGGGCCTAATCCCGATGGAAGGTGAGGAAGAGCTTAAACTCATGTCCT 
TCGACATCGAGACGCTCTACCACGAGGGAGAAGAGTTTGGAACCGGGCCG 
ATTCTGATGATAAGCTACGCCGATGAAAGCGAGGCGCGCGTGATAACCTG 
GAAGAAGATCGACCTTCCTTACGTTGAGGTTGTCTCCACCGAGAAGGAGA 
TGATTAAGCGCTTCTTGAGGGTCGTTAAGGAGAAGGACCCGGACGTGCTG 
ATAACATACAACGGCGACAACTTCGACTTCGCCTACCTGAAAAAGCGCTG 
TGAGAAGCTTGGCGTGAGCTTTACCCTCGGGAGGGACGGGAGCGAGCCGA Extein 1 
AGATACAGCGCATGGGGGACAGGTTTGCGGTCGAGGTGAAGGGCAGGGTA 
CACTTCGACCTTTATCCAGTCATAAGGCGCACCATAAACCTCCCGACCTA 
CACCCTTGAGGCTGTATACGAGGCGGTTTTCGGCAAGCCCAAGGAGAAGG 
TCTACGCCGAGGAGATAGCCACCGCCTGGGAGACCGGCGAGGGGCTTGAG 
AGGGTCGCGCGCTACTCGATGGAGGACGCGAGGGTTACCTACGAGCTTGG 
CAGGGAGTTCTTCCCGATGGAGGCCCAGCTTTCCAGGCTCATCGGCCAAG 
GCCTCTGGGACGTTTCCCGCTCCAGCACCGGCAACCTCGTCGAGTGGTTC 
CTCCTAAGGAAGGCCTACGAGAGGAACGAACTCGCTCCCAACAAGCCCGA 
CGAGAGGGAGCTGGCGAGGAGAAGGGGGGGCTACGCCGGTGGCTACGTCA 
AGGAGCCGGAGCGGGGACTGTGGGACAATATCGTGTATCTAGACTTTCGT 
AGTCTCTACCCTTCAATCATAATCACCCACAACGTCTCGCCAGATACGCT 
CAACCGCGAGGGGTGTAGGAGCTACGACGTTGC CCC CGAGGTCGGTCACA 
AGTTCTGCAAGGACTTCCCCGGCTTCATTCCGAGCCTGCTCGGAAACCTG 
CTGGAGGAAAGGCAGAAGATAAAGAGGAAGATGAAGGCAACTCTCGACCC 
GCTGGAGAAGAATCTCCTCGATTACAGGCAACGCGCCATCAAGATTCTCG 
CCAAC 

AGCCTTCTTCCCGGGGAGTGGGTTGCGGTCATTGAAGGGGGGAAA 
CTCAGGCCCGTCCGCATCGGCGAGCTGGTTGATGGACTGATGGAAGCCAG 
CGGGGAGAGGGTGAAAAGAGACGGCGACACCGAGGTCCTTGAAGTCGAGG 
GGCTTTACGCCTCTCCTTCGACAGGGAGTCCAAGAAAGCCCGCACAATGC 
CGGTGAAAGCCGTGATAAGGCACCGCTATGCCGGGGAAGTTTACAGAATA 
GCTCTCAACTCCGGAAGGAGGATTAAGCGTGACGCGCGGCCACAGCCTCT Intein 1 
TCGCGTACCGGGACGCGAGCTTGTGGAGGTGACGGGGGAGGAGGAGGTTC 
AAGCCCGGCGACCTCCTGGCGGTGCCAAGCGGATAACCCTCCCGGAGAGG 
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AGGGAGAGGCTCAACATCGTTGAACTGCTCCTCGAACTffCCCGAGGAGGA 

AACGGCCGACATGTCATCGACATTCCGGCAAGGGTAGAAAGAACTTCTTC 

AGGGGAATGCTCAGAACCCTCCGCTGGATTTTCGGGGAGGAGAAGACCGG Illtein 1 

AGGGCGGCCAGGCGCTACCTGGAGCACCTTGCGTGGGCTCGGCTACGTGA 

AGCTGAGGAAAATCGGCTACGGGGTGGTTGATAGGGAGGGACTGGGAAAG 

GTACCGCGCTTCTACGAGAGGCTCGTGGAGGTAATCCGCTACAACGGCAA 

CAGGGGGGAGTTCATCGCCGATTTCAACGCGCTCCGCCCCGTCCTCCGCC 

TGATGATGCCCGAGAAGGAGCTTGAAGAGTGGCTCGTTGGGACGAGGAAC 

GGGTTCAGGATAAGGCCGTTCATAGAGGTTGATTGGAAGTTCGCAAAGCT 

CCTCGGCTACTACGTGAGCGAGGGGAGCGCCGGGAAGTGGAAAAACCGGA 

CCGGGGGCTGGAGCTACTCGGTGAGGCTTTACAACGAGGACGGGAGCGTT 

CTCGACGACATGGAGAGACTCGCGAGGAGTTCTTTGGGGGCGTGAGCGCG 

GGGGGAACTACGTCGAGATTTCAAAGAAGATGGCCTACATAATCTTCGAG 

GGGCTCTGCGGTTCACCGGCCGAGAACAAGAGGGTTCCGTGGCTTATCTT 

CACGTCCCCTGAGGAGGTCCGCTGGGCCTTCCTTGAGGGGTACTTCATCG 

GCGACGGCGACGTTCACCCGAGCAAGATGGTTCGGCTCTCCACCAAGAGC 

GAGCTTCTGGCTAACGGCCTCGTCCTGCTCCTGAACTCGCTGGGCGTCTC 

AGCGATAAACGTCCGCCACGACAGCGGGGTTTACAGGGTCTACGTGAACG 

AGGAACTGCCCTTTACAGAGTACCGGAAGCGGAAGAACGCCTCACTTACT 

CCCACGTCATACCGAGGGAAGTGCTGGAGGAGACTTCGGCCGGGCCTTCC 

AGAAGAACATGAGTCACGGGAAATTCAGGGAGCTGGTGGAAAGCGGGGAG 

CTCGACGCGGAAAGGGCCGGTAGGATAGGCTGGCTCCTCGACGGGGATAT 

AGTCCTCGACAGGGTCTCGGAAGTCAGGAAGGAAAGCTACGAGGGGTACG 

TCTACGACCTGAGCGTTGAGGAGGACGAGAACTTCTGGCGGGCTTTGGGT 

TCCTCTACGCGCACAACNN 



CAAGATGGTACTGCAGGGAGTGCGCCGAGAGCGTTACGGCATGGGGAAGG 

GAGTACATCGAAATGGTCATCAGAGAGCTTGAGGAAAAGTTCGGTTTTAA 

AGTCCTCTATGCAGACACAGACGGTCTCCATGCCACCATTCCTGGAGCGG 

ACGCTGAAACAGTCAAGAAAAAGGCAATGGAGTTCTTAAACTATATCAAT 

CCCAAACTGCCCGGCCTTCTCGAACTCGAATACGAGGGCTTCTACGTCAG 

GGGCTTCTTCGTCACGAAGAAAAAGTACGCGGTCATCGACGAGGAGGGCA 

AGATAACCACGCGCGGGCTTGAGATAGTCAGGCGCGACTGGAGCGAGATA 

GCGAAGGAGACGCAGGCGAGGGTTTTGGAGGCGATACTCAGGCACGGTGA Extein 2 

CGTTGAAGAGGCCGTCAGAATTGTCAGGGAAGTCACCGAAAAGCTGAGCA 

AGTACGAGGTTC CGC CGGAGAAGCTGGTTATCCACGAGCAGATAACGCGC 

GAGCTCAAGGACTACAAGGCCACCGGCCCGCACGTAGCCATAGCGAAGCG 

TTTGGCCGCCAGAGGTGTTAAAATCCGGCCCGGAACTGTGATAAGCTACA 

TCGTTCTGAAGGGCTCCGGAAGGATAGGCGACAGGGCGATTCCCTTCGAC 

GAGTTCGACCCGACGAAGCACAAGTACGATGCGGACTACTACATCGAGAA 

CCAGGTTCTGCCGGCAGTTGAGAGAATCCTCAGGGCCTTCGGCTACCGCA 

AGGAAGACCTGCGCTACCAGAAGACGAGGCAGGTCGGGCTTGGCGCGTGG 

CTGAAGCCGAAGGGGAAGAAGAAGTGA 



AGCATTAAATGCTTCCGACATTGCCTTATTTATGAAACTCCTGTTGTGCC 
TGAGTTTGTGCCAGAAAACAGCCTGTTCTGACGGCGCTTTTTCTTGCCAG 
GTCTCTTGAGTTTCGCAAGGGTCTTCTCGACCAGCTCAATGGTCTTGTCG 
TGATTGTTTNNNNNNNNNNNNN^ 

GGTAATAGACAGGGATTCCTTCCTCAAGGACTTCCCGGGAGGCATTGGAG 
TTTTTTGGTGGGGCTTTCACAGGATTTGCTCATCTTGTGGATTTCTCGTT 
CGATTGAATCTGTCCACTTGAGGGTGTAGGTCGAGACGGTGGAGCGCGTA 



AGCTACTACGGCTACTACGGCTATGCCAGGG 



GGAATTATCTGGTTTCTTTTCCC 





TTCCGGGAGCGGGTCTTGAGGCTCCATTTTTCAGTCCTCCTCCGGCGAAG 3* Untranslated sequence 

AAGTGGAACTCAAGCCGGGTGTTAGCTTATGTTATGTTCCCAACTCCTCC 

AGCACCTCCAGGATCCCCTCAATCCCGGAACCTCGAAGCCCCTCTCGTGG 

ATCTTTCTAACTTCCTCTGCCTCCGGGTTTATCCAGACCGCCCACATGCC 

GGCTCTCAGCGCACCCTCGAAATCCTCCGCGTAGGTGTCGCCGATGTGGA 

TTGCCTCGTCCGGCTCGACCCCGAAGCATCGAGCGGTTTTCTGAACATCT 

CGGGCATCGGCTTATACGCCAGAACCTCGTCGGCGAAGAAGGTTCCCTCA 

ATGTAGTCCATCAGGCCGAACCTCTCCjAGGGGGGGCCCGGTACCCAATTC 

GCCCTATAGTGAGTCGATTACAATTCACTGGCCGTCGTTTTACAACGTCG 

TGACTGGGAAAACCCTGGCGTTACCCAACTTAAGTCGCTTTGCAGCACAT 

CCCCC 
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Sequencing with Purified Mutants 
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Sequencing with Dye-labeled Dideoxynucleotides 
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Sequencing with the P410L, A485T Double Mutant 
and <x- 33 P Dideoxynucleotides 
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ayerneiapnkpderelarrrggyac^ 

ayernelapnkpderelarrrggyaggyvkeperglwdnivyldfrslypsiiithHvsp 

AYERNELAPNKPDERELARRRGGYAGGYVKEPERGLWDNI VYLDFRSLYPSI I ITHNVSP 
AYERNEIAPNKPDERELARRRGGYAGGYVKEPERGLWDNIVYLDFRSLYPSIIITHNVSP 
AYERNELAPNKPDERELARRRGGYAGGYVKEPERGLWDNIVYLDFRSLYPSIIITHNVSP 

ayernelapnkpderelarrrggyaggyvkepergSwdni|yldfrslypsi I ITHNVSP 

AYERNELAPNKPDERELARRRGGYAGGYVKEPERGLWDMI VYLDFRSLYPSI I ITHNVSP 
AYERNELAPNKPDERELARRRGGYAGGYVKEPERGLWDNIVYLDFRSLYPSIIITHNVSP 
AYERNELAPNKPDERELARRRGGYAGGYVKEPERGgWDNIVYLDFRSLYPSIIITHNVSP 
AYERN^LAPNKPDERELARRRGGYAGGYVKEPERGLWDNIVYLDFRSLYPSII ITHNVSP 
AYERNELAPNKPDERELARRRGGYAGGYVKEPERGLWDNIVYLDFRSLYPSIIITHNVSP 
AYERNELAPNKPDERELARRRGGYAGGYVKEPERGLWDNIVYLDFRS^YPSII ITHNVSP 
AYERNELAPNKPDERELARRRGGYAGGYVKEPERGLWDNIVYLDFRSLYPSIIITHNVSP 
AYERNELAPNKPDEREIiARRRGGYAGGYVKEPERGLWDNI VYLDFRSLYPS 1 1 ITHNVR P 



DTLmEGCRSYDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATlSpLESlS 
DTL|REGCRSYDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEKNLLD 
DTLNREGCRSYDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEICNLLD 
DTLNREGCRSYDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKII^KMKATLDPLEKNLLD 
DTLNREGCRSYDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEKNLLD 
DTLjgREGCRSYDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEKNLLD 
DTLNREGCRSYDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEKNLLD 
DTLNREGCRSYJVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEKNLLD 

dtlijregcrsy^apevghkfckdfpgfipsllgnlleSrqkikrkmkatldpleknlld 

DTLNREGCRSYDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEKNLLD 
DTLNREGCRSYDVAPEVGHKFCKDFPGFIPSLLGNgLEERQKIKRKMKATLDPLEKNLLD 

dtlnregcrsydvapeSghkfckdfpgfipsllgnlleerqkikrkmkatldpleknBld 

DTLNREGCRSYDVAPEVGHKFCKDFPGFIPSLLGNLLEERQKIKRKMKATLDPLEKNLLD 

Idtlnregcrsydvapevghkfckdfpgfipsllgnlleerqkikrkmkatldpleknlld 



YRQRAIKILANSYYGYgGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAI KI LANS YYGYYGYARARWYCRECAE S VTAWGRE YI EMV IRE LEEKFG FKVLYAD 
YRQRAIKILANSYYGYYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAI KI LANS YYGYYGYARARWYCRE CAES VTAWGRE Y I EMV I RE LEE KFGFKVLYAD 
YRQRAIKILAN^YYGYYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAI KI LANS YYGYYGYARARWYCRECAE S VTAWGRE Y I EMVI RELE EKFGFKVL YAD 
YRQRAIKILANSYYGYYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAI K I LANS YYGYYGYARARWYCRECAE S VTAWGRE Y I EMV I RELE E KFGFKVLYAD 
YRQRAI KI LANS YYGYYGYARARWYCRECAE S VTAWGRE Y I EMV I RELE EKFGFKVLYAD 
YRQRAIKILANSYYGYYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAIKILANSYYGYYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAIKILANSYYGYYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAIKILANSYYGjJjYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 
YRQRAIKILANSYYGYYGYARARWYCRECAESVTAWGREYIEMVIRELEEKFGFKVLYAD 



TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELgYEGFYWGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINgKLPGLLELEYEGFYVRGFFVTKKKgAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLEgEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 
TDGLHATIPGADAETVKKKAMEFLNYINPKLPGLLELEYEGFYVRGFFVTKKKYAVIDEE 



GKITTRGLEIVRRDWSEIAKETQARVLEAJLRHGDVEEAVRIVREVTEKLSKYEVPPEKL 
GKI TTRGLE I VRRDWS E I AKETQARVLEAI LRHGDVEEAVR I VREVTEKL S KYE VP PE^L 
GKI TTRGLE I VRRDWS E I AKETQARVLEAI LRHGDVEEAVR I VR^VTEKL S KYE VP PEKL 
GKITTRGLE I VRRDWS E I AKETQARVLEAI LRHGDVEEAVRI VREVTEKLS KYEVPPEKL 
GKITTRGLE I VRRDWS E I AKETQARVLEAI LRHjgDVEE AVRIVREVTEKLS KYEVPPEKL 
GKITTRGLEIVRRDWSglAKETQARVLEAILRHGDVEEA|RIVREVTEKLSKYEVPPEKL 
GKITTRGLE IVRRDWSEIAKETQARVLEAI LRHGDVEEAVR IVREVTEKLS KYEVPPEKL 
GKI TTRGLE I VRRDWS E I AKETQARVLEAI LRHGDVEEAVR I VRE VTEKLgjKYE VP PEKL 
GKITTRGLEIVRRDWSEIAKETQARVLEAILRHGDVEEAVRIVREVTEKLSKYEVPPEKL 
GKI TTRGLE IVRRDWSE I AKETQARVLEAI LRHGDVEEAVRI VREVTEKLSKYEVPPEKL 
GKI TTRGLE I VRRDWSE I AKETQARVLEAI LRHGDVEEAVR I VREVTEKLSKYEVP PjffiCL 
GKITTRGLE IVRRDWSE I AKETQARVLEAI LRHGDVEEAVR I VREVTEKL SKYEVPPgg 
GKITTRGLE IVRRDWSE IAKETQARVLEAI LRHGDVEEAVRI VREVTEKLSKYEVPPEKL 
GKI TTRGLE I VRRDWS EI AKETQARVLEAI LRHGDVEEAVR I VREVTEKLS KYE VP PEKL 
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ATTY. DOCKET NO: 25436/1560 



DECLARATION AND POWER OF ATTORNEY 
FOR A PATENT APPLICATION 



As a below-named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and 
joint inventor (if plural names are listed below) of the subject matter which is claimed and for which a 
patent is sought on the invention entitled 

"COMPOSITIONS AND METHODS UTILIZING DNA POLYMERASES" 

the specification of which: 

3(check one) [X ] is attached hereto. 

[ ] was filed on as 

Application Serial No. 

and was amended on 

(if applicable) 

I hereby state that I have reviewed and understand the contents of the above-identified specification, 
t including the claims, as amended by any amendment referred to above. 

J I do not know and do not believe that the invention was ever patented or described in any printed 
i publication in any country before my or our invention thereof or more than one year prior to this 
t application. 

I do not know and do not believe that the invention was in public use or on sale in the United States of 
America more than one year prior to this application. 

I acknowledge the duty to disclose information which is material to the patentability of this application in 
accordance with Title 37, Code of Federal Regulations, § 1.56(a). 

I hereby claim foreign priority benefits under Title 35, United States Code, § 119 of any foreign 
application(s) for patent or inventor's certificate listed below and have also identified below any foreign 
application for patent or inventor's certificate having a filing date before that of the application on which 
priority is claimed: 

Prior Foreign Application(s) Priority 

Claimed 



60/162.600 
(Number) 



US 

(Country) 



29/10/99 

(Day/Month/Year filed) 



[ ] [ ] 
Yes No 



(Number) 



(Number) 



(Country) 



(Country) 



Day/Month/Year filed) 



Day/Month/Year filed) 



[ ] [ ] 
Yes No 

[ ] [ ] 

Yes No 



I hereby claim the benefit under Title 35, United States Code, § 120 of any United States Patent 
application(s) listed below and, insofar as the subject matter of each of the claims of this application is not 
disclosed in the prior United States application in the manner provided by the first paragraph of Title 35, 
United States Code, § 112, 1 acknowledge the duty to disclose material information as defined m Title 37, 
Code of Federal Regulations, § 1.56(a) which occurred between the filing date of the prior application and 
the national or PCT international filing date of this application: 



(Application Serial No.) 


(Filing Date) 


(Status: patented/pending/abandoned) 


-( Application Serial No.) 


(Filing Date) 


(Status: patented/pending/abandoned) 


5 (Application Serial No.) 


(Filing Date) 


(Status: patented/pending/abandoned) 



as 

HAs a named inventor, I hereby appoint the following attorneys to prosecute this application and transact all 
□business in the Patent and Trademark Office connected therewith: 

^Kathleen M. Williams, Reg. No. 34,380 
E Paula A. Campbell Evans, Reg. No. 32,503 
° Richard B. Smith, Reg. No. 34,020 

Elizabeth N. Spar, Reg. No. 45,123 

Mark J. Fitzgerald, Reg. No. 45,928 

John Garvey, Reg. No. 37,833 



Send Correspondence to: 



Kathleen M. Williams, Esq. 
Palmer & Dodge, LLP 

One Beacon Street, Boston, MA 02108-3190 



Direct telephone calls to: Kathleen M. Williams at telephone: (617) 227-0451 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made 
on information and belief are believed to be true; and further that these statements were made with the 
knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or 
both, under Section 1001 of Title 18 of the United States Code and that such willful false statements may 
jeopardize the validity of the application or any patent issued thereon. 
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Full name of first inventor Joseph A. Sorge 




Fi|#tTfivelfttar's signature L/ Date 

Residence United States 
Citizenship United States 

Post Office Address 5320 Pine Meadow Road, Wilson, WY 83014 



Full name oflftcand inventor Holly Hurlbut Hogrefe 

Second idventor^s signature 
^Residence United States 
Scitizenship United States 

Upost Office Address 6045 Dirac Street, San Diego, CA 92122 



hi 



lo Moo 



Date 



Full name of third inventor// Connie Jo Hansen 
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jThird Inventor's signature 
\ Residence United States 
Citizenship United States 

Post Office Address 3645 Gayle Street, S an Diego, CA 92 1 1 5 



Date 



